Overview
Brought to you by YData
Dataset statistics
| Number of variables | 100 |
|---|---|
| Number of observations | 601451 |
| Missing cells | 17024416 |
| Missing cells (%) | 28.3% |
| Total size in memory | 458.9 MiB |
| Average record size in memory | 800.0 B |
Variable types
| Text | 100 |
|---|
Dataset
| Description | Mammal NMNH Extant Specimen Records 0054884-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.dys66y |
license has constant value "CC0_1_0" | Constant |
publisher has constant value "National Museum of Natural History, Smithsonian Institution" | Constant |
collectionID has constant value "urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22" | Constant |
collectionCode has constant value "MAMM" | Constant |
datasetName has constant value "NMNH Extant Biology" | Constant |
occurrenceStatus has constant value "PRESENT" | Constant |
kingdom has constant value "Animalia" | Constant |
datasetKey has constant value "821cc27a-e3bb-4bc5-ac34-89ada245069d" | Constant |
publishingCountry has constant value "US" | Constant |
kingdomKey has constant value "1" | Constant |
protocol has constant value "EML" | Constant |
lastCrawled has constant value "2024-12-02T11:48:23.416Z" | Constant |
publishedByGbifRegion has constant value "NORTH_AMERICA" | Constant |
recordNumber has 50821 (8.4%) missing values | Missing |
recordedBy has 55563 (9.2%) missing values | Missing |
sex has 88216 (14.7%) missing values | Missing |
lifeStage has 550088 (91.5%) missing values | Missing |
preparations has 26965 (4.5%) missing values | Missing |
associatedSequences has 600397 (99.8%) missing values | Missing |
occurrenceRemarks has 590662 (98.2%) missing values | Missing |
eventDate has 28480 (4.7%) missing values | Missing |
startDayOfYear has 67487 (11.2%) missing values | Missing |
endDayOfYear has 67487 (11.2%) missing values | Missing |
year has 28519 (4.7%) missing values | Missing |
month has 45368 (7.5%) missing values | Missing |
day has 68254 (11.3%) missing values | Missing |
verbatimEventDate has 36490 (6.1%) missing values | Missing |
habitat has 468915 (78.0%) missing values | Missing |
continent has 39181 (6.5%) missing values | Missing |
waterBody has 539858 (89.8%) missing values | Missing |
islandGroup has 596682 (99.2%) missing values | Missing |
island has 564842 (93.9%) missing values | Missing |
stateProvince has 93954 (15.6%) missing values | Missing |
county has 447402 (74.4%) missing values | Missing |
locality has 35404 (5.9%) missing values | Missing |
verbatimElevation has 599861 (99.7%) missing values | Missing |
decimalLatitude has 447917 (74.5%) missing values | Missing |
decimalLongitude has 447917 (74.5%) missing values | Missing |
verbatimCoordinateSystem has 468202 (77.8%) missing values | Missing |
georeferenceProtocol has 592196 (98.5%) missing values | Missing |
georeferenceRemarks has 601383 (> 99.9%) missing values | Missing |
identificationQualifier has 599947 (99.7%) missing values | Missing |
typeStatus has 597715 (99.4%) missing values | Missing |
identifiedBy has 593267 (98.6%) missing values | Missing |
specificEpithet has 29657 (4.9%) missing values | Missing |
infraspecificEpithet has 386527 (64.3%) missing values | Missing |
elevation has 496901 (82.6%) missing values | Missing |
elevationAccuracy has 597572 (99.4%) missing values | Missing |
depth has 601448 (> 99.9%) missing values | Missing |
distanceFromCentroidInMeters has 601180 (> 99.9%) missing values | Missing |
mediaType has 45831 (7.6%) missing values | Missing |
speciesKey has 29663 (4.9%) missing values | Missing |
species has 29663 (4.9%) missing values | Missing |
gbifRegion has 15955 (2.7%) missing values | Missing |
level0Gid has 473902 (78.8%) missing values | Missing |
level0Name has 473902 (78.8%) missing values | Missing |
level1Gid has 473930 (78.8%) missing values | Missing |
level1Name has 473930 (78.8%) missing values | Missing |
level2Gid has 475037 (79.0%) missing values | Missing |
level2Name has 475037 (79.0%) missing values | Missing |
level3Gid has 539154 (89.6%) missing values | Missing |
level3Name has 539390 (89.7%) missing values | Missing |
iucnRedListCategory has 210302 (35.0%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
Reproduction
| Analysis started | 2025-01-08 22:53:29.700947 |
|---|---|
| Analysis finished | 2025-01-08 22:53:52.845809 |
| Duration | 23.14 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 601451 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 601451 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1322535732 |
|---|---|
| 2nd row | 1322538146 |
| 3rd row | 1317206206 |
| 4th row | 1317210025 |
| 5th row | 1317210456 |
| Value | Count | Frequency (%) |
| 1322535732 | 1 | < 0.1% |
| 1322555094 | 1 | < 0.1% |
| 1322560018 | 1 | < 0.1% |
| 1322558352 | 1 | < 0.1% |
| 1317224532 | 1 | < 0.1% |
| 4041103536 | 1 | < 0.1% |
| 1317206206 | 1 | < 0.1% |
| 1317210025 | 1 | < 0.1% |
| 1317210456 | 1 | < 0.1% |
| 1317211504 | 1 | < 0.1% |
| Other values (601441) | 601441 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1342473 | |
| 3 | 953825 | |
| 2 | 772027 | |
| 8 | 469400 | 7.8% |
| 9 | 463026 | 7.7% |
| 0 | 459240 | 7.6% |
| 7 | 444579 | 7.4% |
| 4 | 377786 | 6.3% |
| 5 | 367488 | 6.1% |
| 6 | 364666 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6014510 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1342473 | |
| 3 | 953825 | |
| 2 | 772027 | |
| 8 | 469400 | 7.8% |
| 9 | 463026 | 7.7% |
| 0 | 459240 | 7.6% |
| 7 | 444579 | 7.4% |
| 4 | 377786 | 6.3% |
| 5 | 367488 | 6.1% |
| 6 | 364666 | 6.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6014510 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1342473 | |
| 3 | 953825 | |
| 2 | 772027 | |
| 8 | 469400 | 7.8% |
| 9 | 463026 | 7.7% |
| 0 | 459240 | 7.6% |
| 7 | 444579 | 7.4% |
| 4 | 377786 | 6.3% |
| 5 | 367488 | 6.1% |
| 6 | 364666 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6014510 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1342473 | |
| 3 | 953825 | |
| 2 | 772027 | |
| 8 | 469400 | 7.8% |
| 9 | 463026 | 7.7% |
| 0 | 459240 | 7.6% |
| 7 | 444579 | 7.4% |
| 4 | 377786 | 6.3% |
| 5 | 367488 | 6.1% |
| 6 | 364666 | 6.1% |
license
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CC0_1_0 |
|---|---|
| 2nd row | CC0_1_0 |
| 3rd row | CC0_1_0 |
| 4th row | CC0_1_0 |
| 5th row | CC0_1_0 |
| Value | Count | Frequency (%) |
| cc0_1_0 | 601451 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1202902 | |
| 0 | 1202902 | |
| _ | 1202902 | |
| 1 | 601451 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1804353 | |
| Uppercase Letter | 1202902 | |
| Connector Punctuation | 1202902 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1202902 | |
| 1 | 601451 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1202902 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1202902 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3007255 | |
| Latin | 1202902 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1202902 | |
| _ | 1202902 | |
| 1 | 601451 |
Latin
| Value | Count | Frequency (%) |
| C | 1202902 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4210157 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 1202902 | |
| 0 | 1202902 | |
| _ | 1202902 | |
| 1 | 601451 |
modified
Text
| Distinct | 29672 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 12662 ? |
|---|---|
| Unique (%) | 2.1% |
Sample
| 1st row | 2021-08-09T14:50:00Z |
|---|---|
| 2nd row | 2020-04-09T11:54:00Z |
| 3rd row | 2020-03-17T10:16:00Z |
| 4th row | 2020-05-20T10:50:00Z |
| 5th row | 2017-12-08T15:28:00Z |
| Value | Count | Frequency (%) |
| 2021-01-11t15:15:00z | 2641 | 0.4% |
| 2023-02-10t10:31:00z | 2632 | 0.4% |
| 2021-08-09t14:46:00z | 2522 | 0.4% |
| 2020-07-20t15:30:00z | 2313 | 0.4% |
| 2017-12-08t15:27:00z | 2105 | 0.3% |
| 2021-08-09t14:49:00z | 2096 | 0.3% |
| 2017-12-08t15:33:00z | 2050 | 0.3% |
| 2017-12-08t15:36:00z | 2008 | 0.3% |
| 2020-07-24t16:11:00z | 1979 | 0.3% |
| 2017-12-08t15:35:00z | 1972 | 0.3% |
| Other values (29662) | 579133 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3217264 | |
| 2 | 1683114 | |
| 1 | 1412891 | |
| - | 1202902 | 10.0% |
| : | 1202902 | 10.0% |
| T | 601451 | 5.0% |
| Z | 601451 | 5.0% |
| 4 | 455860 | 3.8% |
| 3 | 439973 | 3.7% |
| 5 | 428273 | 3.6% |
| Other values (4) | 782939 | 6.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8420314 | |
| Dash Punctuation | 1202902 | 10.0% |
| Other Punctuation | 1202902 | 10.0% |
| Uppercase Letter | 1202902 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3217264 | |
| 2 | 1683114 | |
| 1 | 1412891 | |
| 4 | 455860 | 5.4% |
| 3 | 439973 | 5.2% |
| 5 | 428273 | 5.1% |
| 9 | 215795 | 2.6% |
| 6 | 207959 | 2.5% |
| 7 | 187569 | 2.2% |
| 8 | 171616 | 2.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 601451 | |
| Z | 601451 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1202902 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1202902 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10826118 | |
| Latin | 1202902 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3217264 | |
| 2 | 1683114 | |
| 1 | 1412891 | |
| - | 1202902 | 11.1% |
| : | 1202902 | 11.1% |
| 4 | 455860 | 4.2% |
| 3 | 439973 | 4.1% |
| 5 | 428273 | 4.0% |
| 9 | 215795 | 2.0% |
| 6 | 207959 | 1.9% |
| Other values (2) | 359185 | 3.3% |
Latin
| Value | Count | Frequency (%) |
| T | 601451 | |
| Z | 601451 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12029020 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3217264 | |
| 2 | 1683114 | |
| 1 | 1412891 | |
| - | 1202902 | 10.0% |
| : | 1202902 | 10.0% |
| T | 601451 | 5.0% |
| Z | 601451 | 5.0% |
| 4 | 455860 | 3.8% |
| 3 | 439973 | 3.7% |
| 5 | 428273 | 3.6% |
| Other values (4) | 782939 | 6.5% |
publisher
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 59 |
| Mean length | 59 |
| Min length | 59 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | National Museum of Natural History, Smithsonian Institution |
|---|---|
| 2nd row | National Museum of Natural History, Smithsonian Institution |
| 3rd row | National Museum of Natural History, Smithsonian Institution |
| 4th row | National Museum of Natural History, Smithsonian Institution |
| 5th row | National Museum of Natural History, Smithsonian Institution |
| Value | Count | Frequency (%) |
| national | 601451 | |
| museum | 601451 | |
| of | 601451 | |
| natural | 601451 | |
| history | 601451 | |
| smithsonian | 601451 | |
| institution | 601451 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 4210157 | |
| i | 3608706 | |
| 3608706 | ||
| a | 3007255 | 8.5% |
| o | 3007255 | 8.5% |
| n | 3007255 | 8.5% |
| s | 2405804 | 6.8% |
| u | 2405804 | 6.8% |
| r | 1202902 | 3.4% |
| m | 1202902 | 3.4% |
| Other values (11) | 7818863 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 27666746 | |
| Space Separator | 3608706 | 10.2% |
| Uppercase Letter | 3608706 | 10.2% |
| Other Punctuation | 601451 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 4210157 | |
| i | 3608706 | |
| a | 3007255 | |
| o | 3007255 | |
| n | 3007255 | |
| s | 2405804 | |
| u | 2405804 | |
| r | 1202902 | 4.3% |
| m | 1202902 | 4.3% |
| l | 1202902 | 4.3% |
| Other values (4) | 2405804 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1202902 | |
| M | 601451 | |
| H | 601451 | |
| S | 601451 | |
| I | 601451 |
Space Separator
| Value | Count | Frequency (%) |
| 3608706 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 601451 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31275452 | |
| Common | 4210157 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 4210157 | |
| i | 3608706 | |
| a | 3007255 | |
| o | 3007255 | |
| n | 3007255 | |
| s | 2405804 | 7.7% |
| u | 2405804 | 7.7% |
| r | 1202902 | 3.8% |
| m | 1202902 | 3.8% |
| N | 1202902 | 3.8% |
| Other values (9) | 6014510 |
Common
| Value | Count | Frequency (%) |
| 3608706 | ||
| , | 601451 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35485609 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 4210157 | |
| i | 3608706 | |
| 3608706 | ||
| a | 3007255 | 8.5% |
| o | 3007255 | 8.5% |
| n | 3007255 | 8.5% |
| s | 2405804 | 6.8% |
| u | 2405804 | 6.8% |
| r | 1202902 | 3.4% |
| m | 1202902 | 3.4% |
| Other values (11) | 7818863 |
institutionID
Text
| Distinct | 50 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 28.8108624 |
| Min length | 2 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | urn:lsid:biocol.org:col:34871 |
|---|---|
| 2nd row | urn:lsid:biocol.org:col:34871 |
| 3rd row | urn:lsid:biocol.org:col:34871 |
| 4th row | urn:lsid:biocol.org:col:34871 |
| 5th row | urn:lsid:biocol.org:col:34871 |
| Value | Count | Frequency (%) |
| urn:lsid:biocol.org:col:34871 | 596967 | |
| nsmt | 977 | 0.2% |
| uam | 775 | 0.1% |
| nrm | 386 | 0.1% |
| rmnh | 354 | 0.1% |
| rcs | 246 | < 0.1% |
| nmv | 238 | < 0.1% |
| nmsz | 188 | < 0.1% |
| zmmu | 179 | < 0.1% |
| fcmm | 127 | < 0.1% |
| Other values (40) | 1015 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2387868 | |
| : | 2387868 | |
| l | 1790901 | 10.3% |
| i | 1193934 | 6.9% |
| r | 1193934 | 6.9% |
| c | 1193934 | 6.9% |
| g | 596967 | 3.4% |
| 7 | 596967 | 3.4% |
| 8 | 596967 | 3.4% |
| 4 | 596967 | 3.4% |
| Other values (31) | 4792015 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11342373 | |
| Other Punctuation | 2984837 | 17.2% |
| Decimal Number | 2984835 | 17.2% |
| Uppercase Letter | 16276 | 0.1% |
| Space Separator | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 4384 | |
| N | 2583 | |
| S | 1796 | |
| A | 1319 | 8.1% |
| U | 1175 | 7.2% |
| R | 1035 | 6.4% |
| T | 978 | 6.0% |
| C | 551 | 3.4% |
| H | 550 | 3.4% |
| Z | 467 | 2.9% |
| Other values (11) | 1438 | 8.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2387868 | |
| l | 1790901 | |
| i | 1193934 | |
| r | 1193934 | |
| c | 1193934 | |
| g | 596967 | 5.3% |
| u | 596967 | 5.3% |
| b | 596967 | 5.3% |
| d | 596967 | 5.3% |
| s | 596967 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 596967 | |
| 8 | 596967 | |
| 4 | 596967 | |
| 3 | 596967 | |
| 1 | 596967 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2387868 | |
| . | 596967 | 20.0% |
| ? | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11358649 | |
| Common | 5969673 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 2387868 | |
| l | 1790901 | |
| i | 1193934 | |
| r | 1193934 | |
| c | 1193934 | |
| g | 596967 | 5.3% |
| u | 596967 | 5.3% |
| b | 596967 | 5.3% |
| d | 596967 | 5.3% |
| s | 596967 | 5.3% |
| Other values (22) | 613243 | 5.4% |
Common
| Value | Count | Frequency (%) |
| : | 2387868 | |
| 7 | 596967 | 10.0% |
| 8 | 596967 | 10.0% |
| 4 | 596967 | 10.0% |
| 3 | 596967 | 10.0% |
| . | 596967 | 10.0% |
| 1 | 596967 | 10.0% |
| ? | 2 | < 0.1% |
| 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17328322 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 2387868 | |
| : | 2387868 | |
| l | 1790901 | 10.3% |
| i | 1193934 | 6.9% |
| r | 1193934 | 6.9% |
| c | 1193934 | 6.9% |
| g | 596967 | 3.4% |
| 7 | 596967 | 3.4% |
| 8 | 596967 | 3.4% |
| 4 | 596967 | 3.4% |
| Other values (31) | 4792015 |
collectionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22 |
|---|---|
| 2nd row | urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22 |
| 3rd row | urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22 |
| 4th row | urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22 |
| 5th row | urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22 |
| Value | Count | Frequency (%) |
| urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22 | 601451 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 3007255 | 11.1% |
| - | 2405804 | 8.9% |
| 5 | 2405804 | 8.9% |
| 6 | 1804353 | 6.7% |
| e | 1804353 | 6.7% |
| u | 1804353 | 6.7% |
| d | 1202902 | 4.4% |
| 9 | 1202902 | 4.4% |
| : | 1202902 | 4.4% |
| 1 | 1202902 | 4.4% |
| Other values (12) | 9021765 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13833373 | |
| Lowercase Letter | 9623216 | |
| Dash Punctuation | 2405804 | 8.9% |
| Other Punctuation | 1202902 | 4.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 3007255 | |
| 5 | 2405804 | |
| 6 | 1804353 | |
| 9 | 1202902 | 8.7% |
| 1 | 1202902 | 8.7% |
| 4 | 1202902 | 8.7% |
| 2 | 1202902 | 8.7% |
| 0 | 601451 | 4.3% |
| 3 | 601451 | 4.3% |
| 7 | 601451 | 4.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1804353 | |
| u | 1804353 | |
| d | 1202902 | |
| b | 1202902 | |
| i | 601451 | 6.2% |
| a | 601451 | 6.2% |
| r | 601451 | 6.2% |
| n | 601451 | 6.2% |
| c | 601451 | 6.2% |
| f | 601451 | 6.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2405804 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1202902 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17442079 | |
| Latin | 9623216 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 3007255 | |
| - | 2405804 | |
| 5 | 2405804 | |
| 6 | 1804353 | |
| 9 | 1202902 | 6.9% |
| : | 1202902 | 6.9% |
| 1 | 1202902 | 6.9% |
| 4 | 1202902 | 6.9% |
| 2 | 1202902 | 6.9% |
| 0 | 601451 | 3.4% |
| Other values (2) | 1202902 | 6.9% |
Latin
| Value | Count | Frequency (%) |
| e | 1804353 | |
| u | 1804353 | |
| d | 1202902 | |
| b | 1202902 | |
| i | 601451 | 6.2% |
| a | 601451 | 6.2% |
| r | 601451 | 6.2% |
| n | 601451 | 6.2% |
| c | 601451 | 6.2% |
| f | 601451 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27065295 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 3007255 | 11.1% |
| - | 2405804 | 8.9% |
| 5 | 2405804 | 8.9% |
| 6 | 1804353 | 6.7% |
| e | 1804353 | 6.7% |
| u | 1804353 | 6.7% |
| d | 1202902 | 4.4% |
| 9 | 1202902 | 4.4% |
| : | 1202902 | 4.4% |
| 1 | 1202902 | 4.4% |
| Other values (12) | 9021765 |
institutionCode
Text
| Distinct | 50 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 3.997244996 |
| Min length | 2 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | USNM |
|---|---|
| 2nd row | USNM |
| 3rd row | USNM |
| 4th row | USNM |
| 5th row | USNM |
| Value | Count | Frequency (%) |
| usnm | 596967 | |
| nsmt | 977 | 0.2% |
| uam | 775 | 0.1% |
| nrm | 386 | 0.1% |
| rmnh | 354 | 0.1% |
| rcs | 246 | < 0.1% |
| nmv | 238 | < 0.1% |
| nmsz | 188 | < 0.1% |
| zmmu | 179 | < 0.1% |
| fcmm | 127 | < 0.1% |
| Other values (40) | 1015 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 601351 | |
| N | 599550 | |
| S | 598763 | |
| U | 598142 | |
| A | 1319 | 0.1% |
| R | 1035 | < 0.1% |
| T | 978 | < 0.1% |
| C | 551 | < 0.1% |
| H | 550 | < 0.1% |
| Z | 467 | < 0.1% |
| Other values (13) | 1441 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2404144 | |
| Other Punctuation | 2 | < 0.1% |
| Space Separator | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 601351 | |
| N | 599550 | |
| S | 598763 | |
| U | 598142 | |
| A | 1319 | 0.1% |
| R | 1035 | < 0.1% |
| T | 978 | < 0.1% |
| C | 551 | < 0.1% |
| H | 550 | < 0.1% |
| Z | 467 | < 0.1% |
| Other values (11) | 1438 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2404144 | |
| Common | 3 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 601351 | |
| N | 599550 | |
| S | 598763 | |
| U | 598142 | |
| A | 1319 | 0.1% |
| R | 1035 | < 0.1% |
| T | 978 | < 0.1% |
| C | 551 | < 0.1% |
| H | 550 | < 0.1% |
| Z | 467 | < 0.1% |
| Other values (11) | 1438 | 0.1% |
Common
| Value | Count | Frequency (%) |
| ? | 2 | |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2404147 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 601351 | |
| N | 599550 | |
| S | 598763 | |
| U | 598142 | |
| A | 1319 | 0.1% |
| R | 1035 | < 0.1% |
| T | 978 | < 0.1% |
| C | 551 | < 0.1% |
| H | 550 | < 0.1% |
| Z | 467 | < 0.1% |
| Other values (13) | 1441 | 0.1% |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MAMM |
|---|---|
| 2nd row | MAMM |
| 3rd row | MAMM |
| 4th row | MAMM |
| 5th row | MAMM |
| Value | Count | Frequency (%) |
| mamm | 601451 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 1804353 | |
| A | 601451 | 25.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2405804 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1804353 | |
| A | 601451 | 25.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2405804 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 1804353 | |
| A | 601451 | 25.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2405804 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 1804353 | |
| A | 601451 | 25.0% |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Extant Biology |
|---|---|
| 2nd row | NMNH Extant Biology |
| 3rd row | NMNH Extant Biology |
| 4th row | NMNH Extant Biology |
| 5th row | NMNH Extant Biology |
| Value | Count | Frequency (%) |
| nmnh | 601451 | |
| extant | 601451 | |
| biology | 601451 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1202902 | 10.5% |
| 1202902 | 10.5% | |
| t | 1202902 | 10.5% |
| o | 1202902 | 10.5% |
| M | 601451 | 5.3% |
| H | 601451 | 5.3% |
| E | 601451 | 5.3% |
| x | 601451 | 5.3% |
| a | 601451 | 5.3% |
| n | 601451 | 5.3% |
| Other values (5) | 3007255 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6615961 | |
| Uppercase Letter | 3608706 | |
| Space Separator | 1202902 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1202902 | |
| o | 1202902 | |
| x | 601451 | |
| a | 601451 | |
| n | 601451 | |
| i | 601451 | |
| l | 601451 | |
| g | 601451 | |
| y | 601451 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1202902 | |
| M | 601451 | |
| H | 601451 | |
| E | 601451 | |
| B | 601451 |
Space Separator
| Value | Count | Frequency (%) |
| 1202902 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10224667 | |
| Common | 1202902 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1202902 | |
| t | 1202902 | |
| o | 1202902 | |
| M | 601451 | 5.9% |
| H | 601451 | 5.9% |
| E | 601451 | 5.9% |
| x | 601451 | 5.9% |
| a | 601451 | 5.9% |
| n | 601451 | 5.9% |
| B | 601451 | 5.9% |
| Other values (4) | 2405804 |
Common
| Value | Count | Frequency (%) |
| 1202902 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11427569 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1202902 | 10.5% |
| 1202902 | 10.5% | |
| t | 1202902 | 10.5% |
| o | 1202902 | 10.5% |
| M | 601451 | 5.3% |
| H | 601451 | 5.3% |
| E | 601451 | 5.3% |
| x | 601451 | 5.3% |
| a | 601451 | 5.3% |
| n | 601451 | 5.3% |
| Other values (5) | 3007255 |
basisOfRecord
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 17.95205428 |
| Min length | 17 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESERVED_SPECIMEN |
|---|---|
| 2nd row | PRESERVED_SPECIMEN |
| 3rd row | PRESERVED_SPECIMEN |
| 4th row | PRESERVED_SPECIMEN |
| 5th row | HUMAN_OBSERVATION |
| Value | Count | Frequency (%) |
| preserved_specimen | 572614 | |
| human_observation | 28837 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 2891907 | |
| R | 1174065 | |
| S | 1174065 | |
| P | 1145228 | 10.6% |
| N | 630288 | 5.8% |
| M | 601451 | 5.6% |
| I | 601451 | 5.6% |
| _ | 601451 | 5.6% |
| V | 601451 | 5.6% |
| C | 572614 | 5.3% |
| Other values (7) | 803310 | 7.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10195830 | |
| Connector Punctuation | 601451 | 5.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2891907 | |
| R | 1174065 | |
| S | 1174065 | |
| P | 1145228 | 11.2% |
| N | 630288 | 6.2% |
| M | 601451 | 5.9% |
| I | 601451 | 5.9% |
| V | 601451 | 5.9% |
| C | 572614 | 5.6% |
| D | 572614 | 5.6% |
| Other values (6) | 230696 | 2.3% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 601451 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10195830 | |
| Common | 601451 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 2891907 | |
| R | 1174065 | |
| S | 1174065 | |
| P | 1145228 | 11.2% |
| N | 630288 | 6.2% |
| M | 601451 | 5.9% |
| I | 601451 | 5.9% |
| V | 601451 | 5.9% |
| C | 572614 | 5.6% |
| D | 572614 | 5.6% |
| Other values (6) | 230696 | 2.3% |
Common
| Value | Count | Frequency (%) |
| _ | 601451 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10797281 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 2891907 | |
| R | 1174065 | |
| S | 1174065 | |
| P | 1145228 | 10.6% |
| N | 630288 | 5.8% |
| M | 601451 | 5.6% |
| I | 601451 | 5.6% |
| _ | 601451 | 5.6% |
| V | 601451 | 5.6% |
| C | 572614 | 5.3% |
| Other values (7) | 803310 | 7.4% |
occurrenceID
Text
Unique 
| Distinct | 601451 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 601451 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/3ebec6a7f-5e95-4543-b061-6d73d80dd2ee |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/3ec070d5d-1893-4600-afa5-e56695ff219b |
| 3rd row | http://n2t.net/ark:/65665/3002acaf9-9788-4539-8883-fe6bfd5f8d88 |
| 4th row | http://n2t.net/ark:/65665/300553499-1544-460e-9507-55ada241f992 |
| 5th row | http://n2t.net/ark:/65665/3005a3503-9c20-443c-899a-559e550dc71e |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/3ebec6a7f-5e95-4543-b061-6d73d80dd2ee | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ecc76d35-e5c5-434e-874b-88c5d85dbb91 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ecff6276-27d1-4ad7-aac3-32c485b9bed6 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3eceb4d85-2fbe-4bf2-aef7-b3393445f319 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300f96572-4f6d-48dc-9b78-1ba0e03bb0ae | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ec5d68e1-4786-40d2-9bdb-bb8ef2ad056d | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3002acaf9-9788-4539-8883-fe6bfd5f8d88 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300553499-1544-460e-9507-55ada241f992 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3005a3503-9c20-443c-899a-559e550dc71e | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300664e6c-5334-4a8e-b9a7-4d84389595e0 | 1 | < 0.1% |
| Other values (601441) | 601441 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 3007255 | 7.9% |
| 6 | 2930823 | 7.7% |
| - | 2405804 | 6.3% |
| t | 2405804 | 6.3% |
| 5 | 2330760 | 6.2% |
| a | 1878835 | 5.0% |
| e | 1729856 | 4.6% |
| 2 | 1729289 | 4.6% |
| 3 | 1728046 | 4.6% |
| 4 | 1727823 | 4.6% |
| Other values (16) | 16017118 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16387822 | |
| Lowercase Letter | 14286179 | |
| Other Punctuation | 4811608 | 12.7% |
| Dash Punctuation | 2405804 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2405804 | |
| a | 1878835 | |
| e | 1729856 | |
| b | 1278851 | |
| n | 1202902 | |
| f | 1128774 | |
| c | 1128212 | |
| d | 1127141 | |
| k | 601451 | 4.2% |
| r | 601451 | 4.2% |
| Other values (2) | 1202902 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2930823 | |
| 5 | 2330760 | |
| 2 | 1729289 | |
| 3 | 1728046 | |
| 4 | 1727823 | |
| 9 | 1279292 | |
| 8 | 1278534 | |
| 0 | 1129193 | 6.9% |
| 7 | 1127612 | 6.9% |
| 1 | 1126450 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3007255 | |
| : | 1202902 | 25.0% |
| . | 601451 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2405804 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23605234 | |
| Latin | 14286179 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 3007255 | |
| 6 | 2930823 | |
| - | 2405804 | |
| 5 | 2330760 | |
| 2 | 1729289 | |
| 3 | 1728046 | |
| 4 | 1727823 | |
| 9 | 1279292 | 5.4% |
| 8 | 1278534 | 5.4% |
| : | 1202902 | 5.1% |
| Other values (4) | 3984706 |
Latin
| Value | Count | Frequency (%) |
| t | 2405804 | |
| a | 1878835 | |
| e | 1729856 | |
| b | 1278851 | |
| n | 1202902 | |
| f | 1128774 | |
| c | 1128212 | |
| d | 1127141 | |
| k | 601451 | 4.2% |
| r | 601451 | 4.2% |
| Other values (2) | 1202902 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37891413 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 3007255 | 7.9% |
| 6 | 2930823 | 7.7% |
| - | 2405804 | 6.3% |
| t | 2405804 | 6.3% |
| 5 | 2330760 | 6.2% |
| a | 1878835 | 5.0% |
| e | 1729856 | 4.6% |
| 2 | 1729289 | 4.6% |
| 3 | 1728046 | 4.6% |
| 4 | 1727823 | 4.6% |
| Other values (16) | 16017118 |
catalogNumber
Text
| Distinct | 601428 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 10.92069179 |
| Min length | 4 |
Unique
| Unique | 601407 ? |
|---|---|
| Unique (%) | > 99.9% |
Sample
| 1st row | USNM 449558 |
|---|---|
| 2nd row | USNM 226903 |
| 3rd row | USNM 386480 |
| 4th row | USNM 68620 |
| 5th row | USNM MME9342 |
| Value | Count | Frequency (%) |
| usnm | 596967 | |
| wam | 63 | < 0.1% |
| mb | 40 | < 0.1% |
| zin | 21 | < 0.1% |
| lacm | 18 | < 0.1% |
| nsmt | 12 | < 0.1% |
| sama | 6 | < 0.1% |
| zmmu | 5 | < 0.1% |
| rmnh | 4 | < 0.1% |
| ncsm | 4 | < 0.1% |
| Other values (601439) | 601471 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 627122 | |
| S | 616877 | 9.4% |
| N | 601401 | 9.2% |
| U | 598144 | 9.1% |
| 597160 | 9.1% | |
| 1 | 405808 | 6.2% |
| 2 | 403390 | 6.1% |
| 3 | 394478 | 6.0% |
| 5 | 393693 | 6.0% |
| 4 | 379861 | 5.8% |
| Other values (25) | 1550327 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3465081 | |
| Uppercase Letter | 2506018 | |
| Space Separator | 597160 | 9.1% |
| Other Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 627122 | |
| S | 616877 | |
| N | 601401 | |
| U | 598144 | |
| R | 17298 | 0.7% |
| T | 17251 | 0.7% |
| E | 14721 | 0.6% |
| A | 10176 | 0.4% |
| C | 553 | < 0.1% |
| H | 550 | < 0.1% |
| Other values (13) | 1925 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 405808 | |
| 2 | 403390 | |
| 3 | 394478 | |
| 5 | 393693 | |
| 4 | 379861 | |
| 6 | 309193 | |
| 7 | 297996 | |
| 0 | 295420 | |
| 8 | 295286 | |
| 9 | 289956 |
Space Separator
| Value | Count | Frequency (%) |
| 597160 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4062243 | |
| Latin | 2506018 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 627122 | |
| S | 616877 | |
| N | 601401 | |
| U | 598144 | |
| R | 17298 | 0.7% |
| T | 17251 | 0.7% |
| E | 14721 | 0.6% |
| A | 10176 | 0.4% |
| C | 553 | < 0.1% |
| H | 550 | < 0.1% |
| Other values (13) | 1925 | 0.1% |
Common
| Value | Count | Frequency (%) |
| 597160 | ||
| 1 | 405808 | |
| 2 | 403390 | |
| 3 | 394478 | |
| 5 | 393693 | |
| 4 | 379861 | |
| 6 | 309193 | |
| 7 | 297996 | |
| 0 | 295420 | |
| 8 | 295286 | |
| Other values (2) | 289958 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6568261 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 627122 | |
| S | 616877 | 9.4% |
| N | 601401 | 9.2% |
| U | 598144 | 9.1% |
| 597160 | 9.1% | |
| 1 | 405808 | 6.2% |
| 2 | 403390 | 6.1% |
| 3 | 394478 | 6.0% |
| 5 | 393693 | 6.0% |
| 4 | 379861 | 5.8% |
| Other values (25) | 1550327 |
recordNumber
Text
Missing 
| Distinct | 172937 |
|---|---|
| Distinct (%) | 31.4% |
| Missing | 50821 |
| Missing (%) | 8.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 28 |
| Mean length | 5.176632221 |
| Min length | 1 |
Unique
| Unique | 147848 ? |
|---|---|
| Unique (%) | 26.9% |
Sample
| 1st row | FMG 2371 |
|---|---|
| 2nd row | 142/19534X |
| 3rd row | 07960 |
| 4th row | 6459 |
| 5th row | B47586/R50468 |
| Value | Count | Frequency (%) |
| no | 47434 | 6.9% |
| number | 47222 | 6.9% |
| cohjr | 5988 | 0.9% |
| nzp | 3372 | 0.5% |
| psc | 2713 | 0.4% |
| jwk | 2021 | 0.3% |
| r | 1947 | 0.3% |
| fm | 1793 | 0.3% |
| jjg | 1781 | 0.3% |
| rem | 1569 | 0.2% |
| Other values (105383) | 570874 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 307242 | 10.8% |
| 2 | 246234 | 8.6% |
| 3 | 208467 | 7.3% |
| 4 | 190900 | 6.7% |
| 0 | 182605 | 6.4% |
| 5 | 181877 | 6.4% |
| 6 | 173588 | 6.1% |
| 7 | 165796 | 5.8% |
| 8 | 159989 | 5.6% |
| 9 | 153227 | 5.4% |
| Other values (69) | 880484 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1969925 | |
| Uppercase Letter | 409557 | 14.4% |
| Lowercase Letter | 285569 | 10.0% |
| Space Separator | 136084 | 4.8% |
| Other Punctuation | 26739 | 0.9% |
| Dash Punctuation | 20734 | 0.7% |
| Close Punctuation | 888 | < 0.1% |
| Open Punctuation | 886 | < 0.1% |
| Currency Symbol | 13 | < 0.1% |
| Math Symbol | 10 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 106292 | |
| R | 28947 | 7.1% |
| M | 24702 | 6.0% |
| J | 23837 | 5.8% |
| C | 21743 | 5.3% |
| H | 19696 | 4.8% |
| X | 17857 | 4.4% |
| B | 15635 | 3.8% |
| P | 15412 | 3.8% |
| E | 14048 | 3.4% |
| Other values (16) | 121388 |
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 47347 | |
| e | 47325 | |
| o | 47216 | |
| m | 47180 | |
| u | 47177 | |
| b | 47174 | |
| n | 1310 | 0.5% |
| a | 152 | 0.1% |
| p | 115 | < 0.1% |
| i | 108 | < 0.1% |
| Other values (13) | 465 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 307242 | |
| 2 | 246234 | |
| 3 | 208467 | |
| 4 | 190900 | |
| 0 | 182605 | |
| 5 | 181877 | |
| 6 | 173588 | |
| 7 | 165796 | |
| 8 | 159989 | |
| 9 | 153227 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 23475 | |
| . | 2050 | 7.7% |
| , | 626 | 2.3% |
| # | 248 | 0.9% |
| ? | 202 | 0.8% |
| & | 47 | 0.2% |
| ; | 44 | 0.2% |
| : | 22 | 0.1% |
| * | 21 | 0.1% |
| ' | 4 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 887 | |
| ] | 1 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 885 | |
| [ | 1 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 6 | |
| + | 4 |
Space Separator
| Value | Count | Frequency (%) |
| 136084 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 20734 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 13 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2155283 | |
| Latin | 695126 | 24.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 106292 | |
| r | 47347 | 6.8% |
| e | 47325 | 6.8% |
| o | 47216 | 6.8% |
| m | 47180 | 6.8% |
| u | 47177 | 6.8% |
| b | 47174 | 6.8% |
| R | 28947 | 4.2% |
| M | 24702 | 3.6% |
| J | 23837 | 3.4% |
| Other values (39) | 227929 |
Common
| Value | Count | Frequency (%) |
| 1 | 307242 | |
| 2 | 246234 | |
| 3 | 208467 | |
| 4 | 190900 | |
| 0 | 182605 | |
| 5 | 181877 | |
| 6 | 173588 | |
| 7 | 165796 | |
| 8 | 159989 | |
| 9 | 153227 | |
| Other values (20) | 185358 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2850396 | |
| None | 13 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 307242 | 10.8% |
| 2 | 246234 | 8.6% |
| 3 | 208467 | 7.3% |
| 4 | 190900 | 6.7% |
| 0 | 182605 | 6.4% |
| 5 | 181877 | 6.4% |
| 6 | 173588 | 6.1% |
| 7 | 165796 | 5.8% |
| 8 | 159989 | 5.6% |
| 9 | 153227 | 5.4% |
| Other values (68) | 880471 |
None
| Value | Count | Frequency (%) |
| ¢ | 13 |
recordedBy
Text
Missing 
| Distinct | 17644 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 55563 |
| Missing (%) | 9.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 124 |
|---|---|
| Median length | 114 |
| Mean length | 11.92282483 |
| Min length | 1 |
Unique
| Unique | 9079 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | F. Greenwell |
|---|---|
| 2nd row | J. Silver |
| 3rd row | Smithsonian Venezuelan Project |
| 4th row | Nelson & E. Goldman |
| 5th row | W. Bowen & V. Thayer |
| Value | Count | Frequency (%) |
| j | 60783 | 4.7% |
| e | 54366 | 4.2% |
| c | 53496 | 4.2% |
| 50457 | 3.9% | |
| r | 49868 | 3.9% |
| a | 44074 | 3.4% |
| w | 37880 | 2.9% |
| h | 30720 | 2.4% |
| d | 24753 | 1.9% |
| m | 23831 | 1.9% |
| Other values (10447) | 856734 |
Most occurring characters
| Value | Count | Frequency (%) |
| 741074 | 11.4% | |
| e | 563544 | 8.7% |
| . | 539103 | 8.3% |
| n | 389678 | 6.0% |
| a | 341353 | 5.2% |
| o | 335107 | 5.1% |
| r | 327053 | 5.0% |
| l | 295446 | 4.5% |
| i | 245022 | 3.8% |
| s | 228632 | 3.5% |
| Other values (70) | 2502515 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3897970 | |
| Uppercase Letter | 1254996 | 19.3% |
| Space Separator | 741074 | 11.4% |
| Other Punctuation | 599060 | 9.2% |
| Close Punctuation | 5447 | 0.1% |
| Open Punctuation | 5376 | 0.1% |
| Dash Punctuation | 2452 | < 0.1% |
| Decimal Number | 2151 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 563544 | |
| n | 389678 | |
| a | 341353 | |
| o | 335107 | |
| r | 327053 | 8.4% |
| l | 295446 | 7.6% |
| i | 245022 | 6.3% |
| s | 228632 | 5.9% |
| t | 223935 | 5.7% |
| h | 116266 | 3.0% |
| Other values (18) | 831934 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 91216 | 7.3% |
| M | 88625 | 7.1% |
| C | 87417 | 7.0% |
| S | 86724 | 6.9% |
| H | 84189 | 6.7% |
| G | 82831 | 6.6% |
| J | 76177 | 6.1% |
| A | 70972 | 5.7% |
| E | 64988 | 5.2% |
| P | 62861 | 5.0% |
| Other values (16) | 458996 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 539103 | |
| & | 50656 | 8.5% |
| , | 8029 | 1.3% |
| ' | 1002 | 0.2% |
| / | 114 | < 0.1% |
| : | 78 | < 0.1% |
| ? | 29 | < 0.1% |
| " | 26 | < 0.1% |
| ; | 13 | < 0.1% |
| # | 10 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1561 | |
| 8 | 243 | 11.3% |
| 2 | 219 | 10.2% |
| 4 | 34 | 1.6% |
| 6 | 33 | 1.5% |
| 0 | 31 | 1.4% |
| 9 | 12 | 0.6% |
| 5 | 8 | 0.4% |
| 3 | 7 | 0.3% |
| 7 | 3 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5375 | |
| [ | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 741074 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5447 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2452 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5152966 | |
| Common | 1355561 | 20.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 563544 | 10.9% |
| n | 389678 | 7.6% |
| a | 341353 | 6.6% |
| o | 335107 | 6.5% |
| r | 327053 | 6.3% |
| l | 295446 | 5.7% |
| i | 245022 | 4.8% |
| s | 228632 | 4.4% |
| t | 223935 | 4.3% |
| h | 116266 | 2.3% |
| Other values (44) | 2086930 |
Common
| Value | Count | Frequency (%) |
| 741074 | ||
| . | 539103 | |
| & | 50656 | 3.7% |
| , | 8029 | 0.6% |
| ) | 5447 | 0.4% |
| ( | 5375 | 0.4% |
| - | 2452 | 0.2% |
| 1 | 1561 | 0.1% |
| ' | 1002 | 0.1% |
| 8 | 243 | < 0.1% |
| Other values (16) | 619 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6508521 | |
| None | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 741074 | 11.4% | |
| e | 563544 | 8.7% |
| . | 539103 | 8.3% |
| n | 389678 | 6.0% |
| a | 341353 | 5.2% |
| o | 335107 | 5.1% |
| r | 327053 | 5.0% |
| l | 295446 | 4.5% |
| i | 245022 | 3.8% |
| s | 228632 | 3.5% |
| Other values (68) | 2502509 |
None
| Value | Count | Frequency (%) |
| ç | 3 | |
| ā | 3 |
individualCount
Text
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 44 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 1 |
| Mean length | 1.000033255 |
| Min length | 1 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 601314 | |
| 2 | 45 | < 0.1% |
| 6 | 8 | < 0.1% |
| 3 | 8 | < 0.1% |
| 4 | 6 | < 0.1% |
| 7 | 5 | < 0.1% |
| 5 | 4 | < 0.1% |
| 271 | 2 | < 0.1% |
| 11 | 2 | < 0.1% |
| 20 | 2 | < 0.1% |
| Other values (11) | 11 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 601326 | |
| 2 | 51 | < 0.1% |
| 6 | 9 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 9 | < 0.1% |
| 7 | 8 | < 0.1% |
| 0 | 7 | < 0.1% |
| 5 | 6 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 601427 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 601326 | |
| 2 | 51 | < 0.1% |
| 6 | 9 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 9 | < 0.1% |
| 7 | 8 | < 0.1% |
| 0 | 7 | < 0.1% |
| 5 | 6 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 601427 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 601326 | |
| 2 | 51 | < 0.1% |
| 6 | 9 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 9 | < 0.1% |
| 7 | 8 | < 0.1% |
| 0 | 7 | < 0.1% |
| 5 | 6 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 601427 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 601326 | |
| 2 | 51 | < 0.1% |
| 6 | 9 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 9 | < 0.1% |
| 7 | 8 | < 0.1% |
| 0 | 7 | < 0.1% |
| 5 | 6 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
sex
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 88216 |
| Missing (%) | 14.7% |
| Memory size | 4.6 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.961610179 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MALE |
|---|---|
| 2nd row | MALE |
| 3rd row | MALE |
| 4th row | FEMALE |
| 5th row | FEMALE |
| Value | Count | Frequency (%) |
| male | 266469 | |
| female | 246766 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 760001 | |
| M | 513235 | |
| A | 513235 | |
| L | 513235 | |
| F | 246766 | 9.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2546472 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 760001 | |
| M | 513235 | |
| A | 513235 | |
| L | 513235 | |
| F | 246766 | 9.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2546472 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 760001 | |
| M | 513235 | |
| A | 513235 | |
| L | 513235 | |
| F | 246766 | 9.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2546472 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 760001 | |
| M | 513235 | |
| A | 513235 | |
| L | 513235 | |
| F | 246766 | 9.7% |
lifeStage
Text
Missing 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 550088 |
| Missing (%) | 91.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 5 |
| Mean length | 6.093024161 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Adult |
|---|---|
| 2nd row | Adult |
| 3rd row | Juvenile |
| 4th row | Juvenile |
| 5th row | Adult |
| Value | Count | Frequency (%) |
| adult | 31097 | |
| juvenile | 11486 | 22.4% |
| immature | 3896 | 7.6% |
| subadult | 2153 | 4.2% |
| embryo | 983 | 1.9% |
| fetus | 681 | 1.3% |
| nestling | 499 | 1.0% |
| neonate | 448 | 0.9% |
| mature | 80 | 0.2% |
| unknown | 40 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 51546 | |
| l | 45235 | |
| t | 38854 | |
| d | 33250 | |
| A | 31097 | |
| e | 29024 | |
| n | 12553 | 4.0% |
| i | 11985 | 3.8% |
| J | 11486 | 3.7% |
| v | 11486 | 3.7% |
| Other values (17) | 36440 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 261593 | |
| Uppercase Letter | 51363 | 16.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 51546 | |
| l | 45235 | |
| t | 38854 | |
| d | 33250 | |
| e | 29024 | |
| n | 12553 | 4.8% |
| i | 11985 | 4.6% |
| v | 11486 | 4.4% |
| m | 8775 | 3.4% |
| a | 6577 | 2.5% |
| Other values (8) | 12308 | 4.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 31097 | |
| J | 11486 | 22.4% |
| I | 3896 | 7.6% |
| S | 2153 | 4.2% |
| E | 983 | 1.9% |
| N | 947 | 1.8% |
| F | 681 | 1.3% |
| M | 80 | 0.2% |
| U | 40 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 312956 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 51546 | |
| l | 45235 | |
| t | 38854 | |
| d | 33250 | |
| A | 31097 | |
| e | 29024 | |
| n | 12553 | 4.0% |
| i | 11985 | 3.8% |
| J | 11486 | 3.7% |
| v | 11486 | 3.7% |
| Other values (17) | 36440 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 312956 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| u | 51546 | |
| l | 45235 | |
| t | 38854 | |
| d | 33250 | |
| A | 31097 | |
| e | 29024 | |
| n | 12553 | 4.0% |
| i | 11985 | 3.8% |
| J | 11486 | 3.7% |
| v | 11486 | 3.7% |
| Other values (17) | 36440 |
occurrenceStatus
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESENT |
|---|---|
| 2nd row | PRESENT |
| 3rd row | PRESENT |
| 4th row | PRESENT |
| 5th row | PRESENT |
| Value | Count | Frequency (%) |
| present | 601451 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1202902 | |
| P | 601451 | |
| R | 601451 | |
| S | 601451 | |
| N | 601451 | |
| T | 601451 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4210157 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1202902 | |
| P | 601451 | |
| R | 601451 | |
| S | 601451 | |
| N | 601451 | |
| T | 601451 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4210157 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1202902 | |
| P | 601451 | |
| R | 601451 | |
| S | 601451 | |
| N | 601451 | |
| T | 601451 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4210157 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1202902 | |
| P | 601451 | |
| R | 601451 | |
| S | 601451 | |
| N | 601451 | |
| T | 601451 |
preparations
Text
Missing 
| Distinct | 542 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 26965 |
| Missing (%) | 4.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 73 |
|---|---|
| Median length | 11 |
| Mean length | 10.02423558 |
| Min length | 4 |
Unique
| Unique | 248 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Skin; Skull |
|---|---|
| 2nd row | Skin; Skull |
| 3rd row | Skin; Skull |
| 4th row | Skin; Skull |
| 5th row | Skin; Skull |
| Value | Count | Frequency (%) |
| skull | 452764 | |
| skin | 367609 | |
| fluid | 101452 | 10.0% |
| skeleton | 36584 | 3.6% |
| partial | 10316 | 1.0% |
| in | 8642 | 0.9% |
| remainder | 8641 | 0.9% |
| anatomical | 5878 | 0.6% |
| baculum/baubellum | 3372 | 0.3% |
| baleen | 2349 | 0.2% |
| Other values (42) | 14726 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 1076304 | |
| k | 859539 | |
| S | 856659 | |
| u | 570461 | |
| i | 506031 | |
| 437847 | ||
| n | 435543 | |
| ; | 404417 | 7.0% |
| d | 111124 | 1.9% |
| e | 103346 | 1.8% |
| Other values (39) | 397512 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3909067 | |
| Uppercase Letter | 1004072 | 17.4% |
| Space Separator | 437847 | 7.6% |
| Other Punctuation | 407794 | 7.1% |
| Decimal Number | 2 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 1076304 | |
| k | 859539 | |
| u | 570461 | |
| i | 506031 | |
| n | 435543 | |
| d | 111124 | 2.8% |
| e | 103346 | 2.6% |
| t | 60548 | 1.5% |
| o | 55332 | 1.4% |
| a | 53911 | 1.4% |
| Other values (15) | 76928 | 2.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 856659 | |
| F | 101451 | 10.1% |
| P | 11688 | 1.2% |
| B | 9093 | 0.9% |
| R | 8650 | 0.9% |
| A | 6797 | 0.7% |
| T | 3295 | 0.3% |
| H | 2684 | 0.3% |
| O | 1310 | 0.1% |
| M | 940 | 0.1% |
| Other values (6) | 1505 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 404417 | |
| / | 3372 | 0.8% |
| , | 4 | < 0.1% |
| . | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 1 | |
| 6 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 437847 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4913139 | |
| Common | 845644 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 1076304 | |
| k | 859539 | |
| S | 856659 | |
| u | 570461 | |
| i | 506031 | |
| n | 435543 | |
| d | 111124 | 2.3% |
| e | 103346 | 2.1% |
| F | 101451 | 2.1% |
| t | 60548 | 1.2% |
| Other values (31) | 232133 | 4.7% |
Common
| Value | Count | Frequency (%) |
| 437847 | ||
| ; | 404417 | |
| / | 3372 | 0.4% |
| , | 4 | < 0.1% |
| 5 | 1 | < 0.1% |
| . | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| + | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5758783 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 1076304 | |
| k | 859539 | |
| S | 856659 | |
| u | 570461 | |
| i | 506031 | |
| 437847 | ||
| n | 435543 | |
| ; | 404417 | 7.0% |
| d | 111124 | 1.9% |
| e | 103346 | 1.8% |
| Other values (39) | 397512 | 6.9% |
Missing 
| Distinct | 1050 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 600397 |
| Missing (%) | 99.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 699 |
|---|---|
| Median length | 49 |
| Mean length | 99.59108159 |
| Min length | 47 |
Unique
| Unique | 1046 ? |
|---|---|
| Unique (%) | 99.2% |
Sample
| 1st row | https://www.ncbi.nlm.nih.gov/gquery?term=AY922964;https://www.ncbi.nlm.nih.gov/gquery?term=AY922875 |
|---|---|
| 2nd row | https://www.ncbi.nlm.nih.gov/gquery?term=KC753815;https://www.ncbi.nlm.nih.gov/gquery?term=KC753933;https://www.ncbi.nlm.nih.gov/gquery?term=KC754042;https://www.ncbi.nlm.nih.gov/gquery?term=KC754162;https://www.ncbi.nlm.nih.gov/gquery?term=KC754280 |
| 3rd row | https://www.ncbi.nlm.nih.gov/gquery?term=KC011508;https://www.ncbi.nlm.nih.gov/gquery?term=KC011594;https://www.ncbi.nlm.nih.gov/gquery?term=KC011682 |
| 4th row | https://www.ncbi.nlm.nih.gov/gquery?term=MN707485;https://www.ncbi.nlm.nih.gov/gquery?term=MN707432 |
| 5th row | https://www.ncbi.nlm.nih.gov/gquery?term=JQ317640;https://www.ncbi.nlm.nih.gov/gquery?term=JQ317668 |
| Value | Count | Frequency (%) |
| https://www.ncbi.nlm.nih.gov/gquery?term=eu021073 | 2 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=fj383131 | 2 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kx998919 | 2 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=eu021074 | 2 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=dq178333;https://www.ncbi.nlm.nih.gov/gquery?term=dq178344 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ay974630;https://www.ncbi.nlm.nih.gov/gquery?term=ay974676 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kc753815;https://www.ncbi.nlm.nih.gov/gquery?term=kc753933;https://www.ncbi.nlm.nih.gov/gquery?term=kc754042;https://www.ncbi.nlm.nih.gov/gquery?term=kc754162;https://www.ncbi.nlm.nih.gov/gquery?term=kc754280 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kc011508;https://www.ncbi.nlm.nih.gov/gquery?term=kc011594;https://www.ncbi.nlm.nih.gov/gquery?term=kc011682 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=mn707485;https://www.ncbi.nlm.nih.gov/gquery?term=mn707432 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jq317640;https://www.ncbi.nlm.nih.gov/gquery?term=jq317668 | 1 | 0.1% |
| Other values (1040) | 1040 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 8515 | 8.1% |
| / | 6360 | 6.1% |
| w | 6360 | 6.1% |
| n | 6360 | 6.1% |
| t | 6360 | 6.1% |
| h | 4240 | 4.0% |
| r | 4240 | 4.0% |
| e | 4240 | 4.0% |
| i | 4240 | 4.0% |
| m | 4240 | 4.0% |
| Other values (48) | 49814 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 65720 | |
| Other Punctuation | 20181 | 19.2% |
| Decimal Number | 12730 | 12.1% |
| Uppercase Letter | 4213 | 4.0% |
| Math Symbol | 2120 | 2.0% |
| Connector Punctuation | 5 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 814 | |
| M | 721 | |
| N | 422 | |
| Y | 404 | |
| A | 392 | |
| T | 258 | 6.1% |
| F | 237 | 5.6% |
| J | 212 | 5.0% |
| C | 171 | 4.1% |
| Q | 146 | 3.5% |
| Other values (12) | 436 |
Lowercase Letter
| Value | Count | Frequency (%) |
| w | 6360 | 9.7% |
| n | 6360 | 9.7% |
| t | 6360 | 9.7% |
| h | 4240 | 6.5% |
| r | 4240 | 6.5% |
| e | 4240 | 6.5% |
| i | 4240 | 6.5% |
| m | 4240 | 6.5% |
| g | 4240 | 6.5% |
| v | 2120 | 3.2% |
| Other values (9) | 19080 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 1517 | |
| 3 | 1452 | |
| 6 | 1407 | |
| 9 | 1389 | |
| 2 | 1352 | |
| 4 | 1216 | |
| 8 | 1213 | |
| 1 | 1128 | |
| 5 | 1094 | |
| 0 | 962 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8515 | |
| / | 6360 | |
| ? | 2120 | 10.5% |
| : | 2120 | 10.5% |
| ; | 1066 | 5.3% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 2120 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 69933 | |
| Common | 35036 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| w | 6360 | 9.1% |
| n | 6360 | 9.1% |
| t | 6360 | 9.1% |
| h | 4240 | 6.1% |
| r | 4240 | 6.1% |
| e | 4240 | 6.1% |
| i | 4240 | 6.1% |
| m | 4240 | 6.1% |
| g | 4240 | 6.1% |
| v | 2120 | 3.0% |
| Other values (31) | 23293 |
Common
| Value | Count | Frequency (%) |
| . | 8515 | |
| / | 6360 | |
| ? | 2120 | 6.1% |
| : | 2120 | 6.1% |
| = | 2120 | 6.1% |
| 7 | 1517 | 4.3% |
| 3 | 1452 | 4.1% |
| 6 | 1407 | 4.0% |
| 9 | 1389 | 4.0% |
| 2 | 1352 | 3.9% |
| Other values (7) | 6684 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104969 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 8515 | 8.1% |
| / | 6360 | 6.1% |
| w | 6360 | 6.1% |
| n | 6360 | 6.1% |
| t | 6360 | 6.1% |
| h | 4240 | 4.0% |
| r | 4240 | 4.0% |
| e | 4240 | 4.0% |
| i | 4240 | 4.0% |
| m | 4240 | 4.0% |
| Other values (48) | 49814 |
Missing 
| Distinct | 5322 |
|---|---|
| Distinct (%) | 49.3% |
| Missing | 590662 |
| Missing (%) | 98.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 44804 |
|---|---|
| Median length | 2082 |
| Mean length | 214.0076003 |
| Min length | 4 |
Unique
| Unique | 4721 ? |
|---|---|
| Unique (%) | 43.8% |
Sample
| 1st row | From ledger catalogue 577876-577900: "field data recorded from field catalogues" |
|---|---|
| 2nd row | Skin found in rotunda hallway hold-up case, 2017. May need tanning before installation into collection. |
| 3rd row | Lectotype designated by Avila Pires (1968:163). |
| 4th row | Skull removed from alcoholic specimen. |
| 5th row | More than 800 dolphins stranded along a 220 km stretch pof the coast of Peru. See STR18239.; Broccetto, Marilia CNN website 22 IV 2012 |
| Value | Count | Frequency (%) |
| the | 13880 | 3.8% |
| of | 9359 | 2.6% |
| and | 7684 | 2.1% |
| in | 7077 | 1.9% |
| for | 6435 | 1.8% |
| to | 6041 | 1.6% |
| 4896 | 1.3% | |
| on | 4761 | 1.3% |
| was | 4231 | 1.2% |
| from | 3875 | 1.1% |
| Other values (19019) | 298259 |
Most occurring characters
| Value | Count | Frequency (%) |
| 355709 | ||
| e | 205843 | 8.9% |
| a | 147185 | 6.4% |
| t | 125245 | 5.4% |
| o | 122482 | 5.3% |
| n | 120296 | 5.2% |
| i | 111994 | 4.9% |
| s | 111800 | 4.8% |
| r | 110930 | 4.8% |
| l | 77896 | 3.4% |
| Other values (148) | 819548 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1587531 | |
| Space Separator | 355709 | 15.4% |
| Uppercase Letter | 132353 | 5.7% |
| Decimal Number | 122350 | 5.3% |
| Other Punctuation | 87540 | 3.8% |
| Dash Punctuation | 8132 | 0.4% |
| Close Punctuation | 6920 | 0.3% |
| Open Punctuation | 6894 | 0.3% |
| Math Symbol | 680 | < 0.1% |
| Connector Punctuation | 461 | < 0.1% |
| Other values (8) | 358 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 205843 | |
| a | 147185 | 9.3% |
| t | 125245 | 7.9% |
| o | 122482 | 7.7% |
| n | 120296 | 7.6% |
| i | 111994 | 7.1% |
| s | 111800 | 7.0% |
| r | 110930 | 7.0% |
| l | 77896 | 4.9% |
| d | 65194 | 4.1% |
| Other values (53) | 388666 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 13793 | 10.4% |
| M | 11265 | 8.5% |
| N | 10762 | 8.1% |
| T | 10560 | 8.0% |
| C | 8190 | 6.2% |
| F | 7728 | 5.8% |
| I | 7523 | 5.7% |
| A | 7439 | 5.6% |
| B | 6332 | 4.8% |
| R | 5318 | 4.0% |
| Other values (18) | 43443 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 36734 | |
| , | 26137 | |
| : | 6493 | 7.4% |
| " | 5631 | 6.4% |
| ; | 4846 | 5.5% |
| / | 3229 | 3.7% |
| ' | 1865 | 2.1% |
| # | 977 | 1.1% |
| & | 535 | 0.6% |
| ? | 299 | 0.3% |
| Other values (12) | 794 | 0.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 20642 | |
| 0 | 20306 | |
| 2 | 20036 | |
| 5 | 10174 | |
| 9 | 10101 | |
| 7 | 9447 | |
| 6 | 8256 | 6.7% |
| 3 | 8246 | 6.7% |
| 4 | 7859 | 6.4% |
| 8 | 7283 | 6.0% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 207 | |
| + | 203 | |
| ~ | 120 | |
| < | 79 | 11.6% |
| > | 62 | 9.1% |
| | | 4 | 0.6% |
| ± | 2 | 0.3% |
| ¬ | 2 | 0.3% |
| − | 1 | 0.1% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 29 | |
| ¼ | 7 | 15.9% |
| ¹ | 5 | 11.4% |
| ¾ | 3 | 6.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7459 | |
| – | 656 | 8.1% |
| — | 17 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6315 | |
| ] | 602 | 8.7% |
| } | 3 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6291 | |
| [ | 600 | 8.7% |
| { | 3 | < 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 90 | |
| » | 1 | 1.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 48 | |
| ¥ | 10 | 17.2% |
Format
| Value | Count | Frequency (%) |
| | 3 | |
| | 2 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 1 | |
| ^ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 355709 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 461 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 83 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 67 |
Other Letter
| Value | Count | Frequency (%) |
| º | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1719816 | |
| Common | 589036 | 25.5% |
| Greek | 76 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 205843 | 12.0% |
| a | 147185 | 8.6% |
| t | 125245 | 7.3% |
| o | 122482 | 7.1% |
| n | 120296 | 7.0% |
| i | 111994 | 6.5% |
| s | 111800 | 6.5% |
| r | 110930 | 6.5% |
| l | 77896 | 4.5% |
| d | 65194 | 3.8% |
| Other values (70) | 520951 |
Common
| Value | Count | Frequency (%) |
| 355709 | ||
| . | 36734 | 6.2% |
| , | 26137 | 4.4% |
| 1 | 20642 | 3.5% |
| 0 | 20306 | 3.4% |
| 2 | 20036 | 3.4% |
| 5 | 10174 | 1.7% |
| 9 | 10101 | 1.7% |
| 7 | 9447 | 1.6% |
| 6 | 8256 | 1.4% |
| Other values (56) | 71494 | 12.1% |
Greek
| Value | Count | Frequency (%) |
| μ | 64 | |
| ο | 2 | 2.6% |
| ή | 1 | 1.3% |
| ϊ | 1 | 1.3% |
| ι | 1 | 1.3% |
| ν | 1 | 1.3% |
| ρ | 1 | 1.3% |
| υ | 1 | 1.3% |
| δ | 1 | 1.3% |
| α | 1 | 1.3% |
| Other values (2) | 2 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2307432 | |
| Punctuation | 858 | < 0.1% |
| None | 637 | < 0.1% |
| Math Operators | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 355709 | ||
| e | 205843 | 8.9% |
| a | 147185 | 6.4% |
| t | 125245 | 5.4% |
| o | 122482 | 5.3% |
| n | 120296 | 5.2% |
| i | 111994 | 4.9% |
| s | 111800 | 4.8% |
| r | 110930 | 4.8% |
| l | 77896 | 3.4% |
| Other values (84) | 818052 |
Punctuation
| Value | Count | Frequency (%) |
| – | 656 | |
| ” | 90 | 10.5% |
| “ | 83 | 9.7% |
| — | 17 | 2.0% |
| • | 4 | 0.5% |
| | 3 | 0.3% |
| … | 2 | 0.2% |
| ″ | 2 | 0.2% |
| ′ | 1 | 0.1% |
None
| Value | Count | Frequency (%) |
| · | 170 | |
| é | 78 | |
| ° | 67 | 10.5% |
| μ | 64 | 10.0% |
| ì | 58 | 9.1% |
| ½ | 29 | 4.6% |
| è | 20 | 3.1% |
| Ö | 12 | 1.9% |
| ä | 10 | 1.6% |
| ü | 10 | 1.6% |
| Other values (44) | 119 |
Math Operators
| Value | Count | Frequency (%) |
| − | 1 |
eventDate
Text
Missing 
| Distinct | 46549 |
|---|---|
| Distinct (%) | 8.1% |
| Missing | 28480 |
| Missing (%) | 4.7% |
| Memory size | 4.6 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 9.72325999 |
| Min length | 4 |
Unique
| Unique | 7620 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | 1989-02-28 |
|---|---|
| 2nd row | 1917-08-08 |
| 3rd row | 1966-05 |
| 4th row | 1894-07-15 |
| 5th row | 1992-11-05 |
| Value | Count | Frequency (%) |
| 1968 | 1161 | 0.2% |
| 1959 | 829 | 0.1% |
| 1965-06 | 704 | 0.1% |
| 1966-06-02 | 682 | 0.1% |
| 1903 | 600 | 0.1% |
| 1905 | 591 | 0.1% |
| 1965 | 543 | 0.1% |
| 1967-08 | 537 | 0.1% |
| 1967-05 | 529 | 0.1% |
| 1968-09-02 | 520 | 0.1% |
| Other values (46539) | 566275 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 1091809 | |
| 1 | 1091166 | |
| 0 | 832958 | |
| 9 | 716794 | |
| 2 | 391838 | 7.0% |
| 6 | 323354 | 5.8% |
| 8 | 308610 | 5.5% |
| 7 | 251407 | 4.5% |
| 3 | 195450 | 3.5% |
| 5 | 191688 | 3.4% |
| Other values (2) | 176072 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4478570 | |
| Dash Punctuation | 1091809 | 19.6% |
| Other Punctuation | 767 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1091166 | |
| 0 | 832958 | |
| 9 | 716794 | |
| 2 | 391838 | 8.7% |
| 6 | 323354 | 7.2% |
| 8 | 308610 | 6.9% |
| 7 | 251407 | 5.6% |
| 3 | 195450 | 4.4% |
| 5 | 191688 | 4.3% |
| 4 | 175305 | 3.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1091809 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 767 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5571146 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 1091809 | |
| 1 | 1091166 | |
| 0 | 832958 | |
| 9 | 716794 | |
| 2 | 391838 | 7.0% |
| 6 | 323354 | 5.8% |
| 8 | 308610 | 5.5% |
| 7 | 251407 | 4.5% |
| 3 | 195450 | 3.5% |
| 5 | 191688 | 3.4% |
| Other values (2) | 176072 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5571146 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 1091809 | |
| 1 | 1091166 | |
| 0 | 832958 | |
| 9 | 716794 | |
| 2 | 391838 | 7.0% |
| 6 | 323354 | 5.8% |
| 8 | 308610 | 5.5% |
| 7 | 251407 | 4.5% |
| 3 | 195450 | 3.5% |
| 5 | 191688 | 3.4% |
| Other values (2) | 176072 | 3.2% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 67487 |
| Missing (%) | 11.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.721050483 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 59 |
|---|---|
| 2nd row | 220 |
| 3rd row | 196 |
| 4th row | 310 |
| 5th row | 77 |
| Value | Count | Frequency (%) |
| 193 | 2428 | 0.5% |
| 222 | 2369 | 0.4% |
| 199 | 2342 | 0.4% |
| 205 | 2305 | 0.4% |
| 207 | 2235 | 0.4% |
| 208 | 2179 | 0.4% |
| 197 | 2151 | 0.4% |
| 202 | 2126 | 0.4% |
| 203 | 2117 | 0.4% |
| 201 | 2091 | 0.4% |
| Other values (356) | 511621 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 275979 | |
| 2 | 269556 | |
| 3 | 182055 | |
| 5 | 109078 | 7.5% |
| 4 | 108546 | 7.5% |
| 6 | 105788 | 7.3% |
| 7 | 102563 | 7.1% |
| 9 | 100514 | 6.9% |
| 0 | 99672 | 6.9% |
| 8 | 99192 | 6.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1452943 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 275979 | |
| 2 | 269556 | |
| 3 | 182055 | |
| 5 | 109078 | 7.5% |
| 4 | 108546 | 7.5% |
| 6 | 105788 | 7.3% |
| 7 | 102563 | 7.1% |
| 9 | 100514 | 6.9% |
| 0 | 99672 | 6.9% |
| 8 | 99192 | 6.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1452943 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 275979 | |
| 2 | 269556 | |
| 3 | 182055 | |
| 5 | 109078 | 7.5% |
| 4 | 108546 | 7.5% |
| 6 | 105788 | 7.3% |
| 7 | 102563 | 7.1% |
| 9 | 100514 | 6.9% |
| 0 | 99672 | 6.9% |
| 8 | 99192 | 6.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1452943 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 275979 | |
| 2 | 269556 | |
| 3 | 182055 | |
| 5 | 109078 | 7.5% |
| 4 | 108546 | 7.5% |
| 6 | 105788 | 7.3% |
| 7 | 102563 | 7.1% |
| 9 | 100514 | 6.9% |
| 0 | 99672 | 6.9% |
| 8 | 99192 | 6.8% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 67487 |
| Missing (%) | 11.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.721117903 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 59 |
|---|---|
| 2nd row | 220 |
| 3rd row | 196 |
| 4th row | 310 |
| 5th row | 77 |
| Value | Count | Frequency (%) |
| 222 | 2369 | 0.4% |
| 193 | 2355 | 0.4% |
| 199 | 2343 | 0.4% |
| 205 | 2304 | 0.4% |
| 207 | 2253 | 0.4% |
| 208 | 2179 | 0.4% |
| 197 | 2150 | 0.4% |
| 204 | 2149 | 0.4% |
| 202 | 2125 | 0.4% |
| 203 | 2117 | 0.4% |
| Other values (356) | 511620 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 275595 | |
| 2 | 269744 | |
| 3 | 182061 | |
| 5 | 109081 | 7.5% |
| 4 | 108843 | 7.5% |
| 6 | 105700 | 7.3% |
| 7 | 102579 | 7.1% |
| 9 | 100408 | 6.9% |
| 0 | 99751 | 6.9% |
| 8 | 99217 | 6.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1452979 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 275595 | |
| 2 | 269744 | |
| 3 | 182061 | |
| 5 | 109081 | 7.5% |
| 4 | 108843 | 7.5% |
| 6 | 105700 | 7.3% |
| 7 | 102579 | 7.1% |
| 9 | 100408 | 6.9% |
| 0 | 99751 | 6.9% |
| 8 | 99217 | 6.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1452979 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 275595 | |
| 2 | 269744 | |
| 3 | 182061 | |
| 5 | 109081 | 7.5% |
| 4 | 108843 | 7.5% |
| 6 | 105700 | 7.3% |
| 7 | 102579 | 7.1% |
| 9 | 100408 | 6.9% |
| 0 | 99751 | 6.9% |
| 8 | 99217 | 6.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1452979 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 275595 | |
| 2 | 269744 | |
| 3 | 182061 | |
| 5 | 109081 | 7.5% |
| 4 | 108843 | 7.5% |
| 6 | 105700 | 7.3% |
| 7 | 102579 | 7.1% |
| 9 | 100408 | 6.9% |
| 0 | 99751 | 6.9% |
| 8 | 99217 | 6.8% |
year
Text
Missing 
| Distinct | 350 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 28519 |
| Missing (%) | 4.7% |
| Memory size | 4.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 75 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1989 |
|---|---|
| 2nd row | 1917 |
| 3rd row | 1966 |
| 4th row | 1894 |
| 5th row | 1992 |
| Value | Count | Frequency (%) |
| 1967 | 30814 | 5.4% |
| 1968 | 27037 | 4.7% |
| 1966 | 22575 | 3.9% |
| 1969 | 15259 | 2.7% |
| 1965 | 12690 | 2.2% |
| 1964 | 12541 | 2.2% |
| 1962 | 11208 | 2.0% |
| 1970 | 10525 | 1.8% |
| 1916 | 9955 | 1.7% |
| 1963 | 9798 | 1.7% |
| Other values (340) | 410530 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 669600 | |
| 9 | 621357 | |
| 6 | 214928 | 9.4% |
| 8 | 199542 | 8.7% |
| 7 | 134576 | 5.9% |
| 0 | 132983 | 5.8% |
| 5 | 87279 | 3.8% |
| 2 | 86813 | 3.8% |
| 4 | 76096 | 3.3% |
| 3 | 68554 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2291728 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 669600 | |
| 9 | 621357 | |
| 6 | 214928 | 9.4% |
| 8 | 199542 | 8.7% |
| 7 | 134576 | 5.9% |
| 0 | 132983 | 5.8% |
| 5 | 87279 | 3.8% |
| 2 | 86813 | 3.8% |
| 4 | 76096 | 3.3% |
| 3 | 68554 | 3.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2291728 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 669600 | |
| 9 | 621357 | |
| 6 | 214928 | 9.4% |
| 8 | 199542 | 8.7% |
| 7 | 134576 | 5.9% |
| 0 | 132983 | 5.8% |
| 5 | 87279 | 3.8% |
| 2 | 86813 | 3.8% |
| 4 | 76096 | 3.3% |
| 3 | 68554 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2291728 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 669600 | |
| 9 | 621357 | |
| 6 | 214928 | 9.4% |
| 8 | 199542 | 8.7% |
| 7 | 134576 | 5.9% |
| 0 | 132983 | 5.8% |
| 5 | 87279 | 3.8% |
| 2 | 86813 | 3.8% |
| 4 | 76096 | 3.3% |
| 3 | 68554 | 3.0% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 45368 |
| Missing (%) | 7.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.192809347 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 8 |
| 3rd row | 5 |
| 4th row | 7 |
| 5th row | 11 |
| Value | Count | Frequency (%) |
| 7 | 63530 | |
| 8 | 55595 | |
| 6 | 55446 | |
| 3 | 50980 | |
| 5 | 50113 | |
| 4 | 46748 | |
| 9 | 43982 | |
| 2 | 43057 | |
| 10 | 40456 | |
| 1 | 39414 | |
| Other values (2) | 66762 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 181841 | |
| 2 | 74610 | |
| 7 | 63530 | 9.6% |
| 8 | 55595 | 8.4% |
| 6 | 55446 | 8.4% |
| 3 | 50980 | 7.7% |
| 5 | 50113 | 7.6% |
| 4 | 46748 | 7.0% |
| 9 | 43982 | 6.6% |
| 0 | 40456 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 663301 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 181841 | |
| 2 | 74610 | |
| 7 | 63530 | 9.6% |
| 8 | 55595 | 8.4% |
| 6 | 55446 | 8.4% |
| 3 | 50980 | 7.7% |
| 5 | 50113 | 7.6% |
| 4 | 46748 | 7.0% |
| 9 | 43982 | 6.6% |
| 0 | 40456 | 6.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 663301 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 181841 | |
| 2 | 74610 | |
| 7 | 63530 | 9.6% |
| 8 | 55595 | 8.4% |
| 6 | 55446 | 8.4% |
| 3 | 50980 | 7.7% |
| 5 | 50113 | 7.6% |
| 4 | 46748 | 7.0% |
| 9 | 43982 | 6.6% |
| 0 | 40456 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 663301 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 181841 | |
| 2 | 74610 | |
| 7 | 63530 | 9.6% |
| 8 | 55595 | 8.4% |
| 6 | 55446 | 8.4% |
| 3 | 50980 | 7.7% |
| 5 | 50113 | 7.6% |
| 4 | 46748 | 7.0% |
| 9 | 43982 | 6.6% |
| 0 | 40456 | 6.1% |
day
Text
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 68254 |
| Missing (%) | 11.3% |
| Memory size | 4.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.708122889 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 28 |
|---|---|
| 2nd row | 8 |
| 3rd row | 15 |
| 4th row | 5 |
| 5th row | 18 |
| Value | Count | Frequency (%) |
| 10 | 19183 | 3.6% |
| 20 | 18565 | 3.5% |
| 22 | 18462 | 3.5% |
| 15 | 18330 | 3.4% |
| 18 | 18186 | 3.4% |
| 14 | 17946 | 3.4% |
| 5 | 17919 | 3.4% |
| 16 | 17902 | 3.4% |
| 27 | 17827 | 3.3% |
| 21 | 17778 | 3.3% |
| Other values (21) | 351099 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 237918 | |
| 2 | 229380 | |
| 3 | 75570 | 8.3% |
| 5 | 53783 | 5.9% |
| 0 | 53189 | 5.8% |
| 8 | 53046 | 5.8% |
| 7 | 52750 | 5.8% |
| 6 | 52475 | 5.8% |
| 4 | 52022 | 5.7% |
| 9 | 50633 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 910766 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 237918 | |
| 2 | 229380 | |
| 3 | 75570 | 8.3% |
| 5 | 53783 | 5.9% |
| 0 | 53189 | 5.8% |
| 8 | 53046 | 5.8% |
| 7 | 52750 | 5.8% |
| 6 | 52475 | 5.8% |
| 4 | 52022 | 5.7% |
| 9 | 50633 | 5.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 910766 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 237918 | |
| 2 | 229380 | |
| 3 | 75570 | 8.3% |
| 5 | 53783 | 5.9% |
| 0 | 53189 | 5.8% |
| 8 | 53046 | 5.8% |
| 7 | 52750 | 5.8% |
| 6 | 52475 | 5.8% |
| 4 | 52022 | 5.7% |
| 9 | 50633 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 910766 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 237918 | |
| 2 | 229380 | |
| 3 | 75570 | 8.3% |
| 5 | 53783 | 5.9% |
| 0 | 53189 | 5.8% |
| 8 | 53046 | 5.8% |
| 7 | 52750 | 5.8% |
| 6 | 52475 | 5.8% |
| 4 | 52022 | 5.7% |
| 9 | 50633 | 5.6% |
Missing 
| Distinct | 45124 |
|---|---|
| Distinct (%) | 8.0% |
| Missing | 36490 |
| Missing (%) | 6.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 82 |
|---|---|
| Median length | 11 |
| Mean length | 10.73425953 |
| Min length | 3 |
Unique
| Unique | 7925 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | 28 Feb 1989 |
|---|---|
| 2nd row | 8 Aug 1917 |
| 3rd row | -- May 1966 |
| 4th row | 15 Jul 1894 |
| 5th row | 5 Nov 1992 |
| Value | Count | Frequency (%) |
| 119289 | 7.0% | |
| jul | 59029 | 3.5% |
| aug | 52663 | 3.1% |
| jun | 52253 | 3.1% |
| mar | 49098 | 2.9% |
| may | 47959 | 2.8% |
| apr | 45015 | 2.6% |
| sep | 41961 | 2.5% |
| feb | 40432 | 2.4% |
| oct | 39123 | 2.3% |
| Other values (873) | 1153619 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1135480 | ||
| 1 | 869039 | |
| 9 | 644744 | 10.6% |
| 2 | 290400 | 4.8% |
| - | 284559 | 4.7% |
| 6 | 256804 | 4.2% |
| 8 | 242113 | 4.0% |
| 7 | 176263 | 2.9% |
| u | 165038 | 2.7% |
| 0 | 163304 | 2.7% |
| Other values (65) | 1836694 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3034227 | |
| Space Separator | 1135480 | 18.7% |
| Lowercase Letter | 1072102 | 17.7% |
| Uppercase Letter | 534667 | 8.8% |
| Dash Punctuation | 284559 | 4.7% |
| Other Punctuation | 3387 | 0.1% |
| Close Punctuation | 7 | < 0.1% |
| Open Punctuation | 6 | < 0.1% |
| Math Symbol | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 165038 | |
| a | 133875 | |
| e | 114602 | |
| r | 97161 | |
| n | 90730 | |
| p | 87414 | |
| c | 68763 | |
| l | 60684 | 5.7% |
| g | 53357 | 5.0% |
| y | 47929 | 4.5% |
| Other values (14) | 152549 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 147559 | |
| A | 97950 | |
| M | 97151 | |
| S | 43634 | 8.2% |
| F | 41188 | 7.7% |
| O | 39198 | 7.3% |
| N | 33829 | 6.3% |
| D | 30011 | 5.6% |
| W | 1456 | 0.3% |
| E | 615 | 0.1% |
| Other values (13) | 2076 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 869039 | |
| 9 | 644744 | |
| 2 | 290400 | 9.6% |
| 6 | 256804 | 8.5% |
| 8 | 242113 | 8.0% |
| 7 | 176263 | 5.8% |
| 0 | 163304 | 5.4% |
| 3 | 136893 | 4.5% |
| 5 | 134478 | 4.4% |
| 4 | 120189 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| * | 2267 | |
| , | 926 | |
| ? | 105 | 3.1% |
| : | 53 | 1.6% |
| / | 21 | 0.6% |
| . | 6 | 0.2% |
| ' | 5 | 0.1% |
| & | 2 | 0.1% |
| ; | 2 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 | |
| < | 1 | |
| ~ | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6 | |
| ] | 1 | 14.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5 | |
| [ | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 1135480 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 284559 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4457669 | |
| Latin | 1606769 | 26.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 165038 | 10.3% |
| J | 147559 | 9.2% |
| a | 133875 | 8.3% |
| e | 114602 | 7.1% |
| A | 97950 | 6.1% |
| r | 97161 | 6.0% |
| M | 97151 | 6.0% |
| n | 90730 | 5.6% |
| p | 87414 | 5.4% |
| c | 68763 | 4.3% |
| Other values (37) | 506526 |
Common
| Value | Count | Frequency (%) |
| 1135480 | ||
| 1 | 869039 | |
| 9 | 644744 | |
| 2 | 290400 | 6.5% |
| - | 284559 | 6.4% |
| 6 | 256804 | 5.8% |
| 8 | 242113 | 5.4% |
| 7 | 176263 | 4.0% |
| 0 | 163304 | 3.7% |
| 3 | 136893 | 3.1% |
| Other values (18) | 258070 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6064438 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1135480 | ||
| 1 | 869039 | |
| 9 | 644744 | 10.6% |
| 2 | 290400 | 4.8% |
| - | 284559 | 4.7% |
| 6 | 256804 | 4.2% |
| 8 | 242113 | 4.0% |
| 7 | 176263 | 2.9% |
| u | 165038 | 2.7% |
| 0 | 163304 | 2.7% |
| Other values (65) | 1836694 |
habitat
Text
Missing 
| Distinct | 7512 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 468915 |
| Missing (%) | 78.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 1014 |
|---|---|
| Median length | 694 |
| Mean length | 27.3692808 |
| Min length | 1 |
Unique
| Unique | 4415 ? |
|---|---|
| Unique (%) | 3.3% |
Sample
| 1st row | Ecological remarks by collector(s): yes |
|---|---|
| 2nd row | Premontane very humid forest |
| 3rd row | Ecological remarks by collector(s): no |
| 4th row | Ecological remarks by collector(s): yes |
| 5th row | Culvert |
| Value | Count | Frequency (%) |
| by | 49297 | 9.4% |
| ecological | 48727 | 9.3% |
| remarks | 48718 | 9.3% |
| collector(s | 48716 | 9.3% |
| yes | 41564 | 8.0% |
| forest | 32139 | 6.2% |
| tropical | 15058 | 2.9% |
| humid | 14768 | 2.8% |
| no | 7275 | 1.4% |
| in | 6943 | 1.3% |
| Other values (3497) | 208498 |
Most occurring characters
| Value | Count | Frequency (%) |
| 389167 | 10.7% | |
| o | 316538 | 8.7% |
| e | 293307 | 8.1% |
| r | 281112 | 7.7% |
| l | 253946 | 7.0% |
| s | 244547 | 6.7% |
| c | 240040 | 6.6% |
| a | 233816 | 6.4% |
| i | 137021 | 3.8% |
| t | 136017 | 3.7% |
| Other values (76) | 1101904 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2931962 | |
| Space Separator | 389167 | 10.7% |
| Uppercase Letter | 134371 | 3.7% |
| Other Punctuation | 62424 | 1.7% |
| Open Punctuation | 49723 | 1.4% |
| Close Punctuation | 49712 | 1.4% |
| Decimal Number | 6872 | 0.2% |
| Dash Punctuation | 3142 | 0.1% |
| Math Symbol | 40 | < 0.1% |
| Final Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 316538 | |
| e | 293307 | |
| r | 281112 | |
| l | 253946 | 8.7% |
| s | 244547 | 8.3% |
| c | 240040 | 8.2% |
| a | 233816 | 8.0% |
| i | 137021 | 4.7% |
| t | 136017 | 4.6% |
| y | 117063 | 4.0% |
| Other values (16) | 678555 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 49837 | |
| T | 18330 | 13.6% |
| S | 10045 | 7.5% |
| R | 7675 | 5.7% |
| P | 6589 | 4.9% |
| G | 6219 | 4.6% |
| C | 4362 | 3.2% |
| M | 4095 | 3.0% |
| A | 3747 | 2.8% |
| B | 3506 | 2.6% |
| Other values (16) | 19966 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 48943 | |
| , | 7291 | 11.7% |
| . | 4022 | 6.4% |
| ; | 832 | 1.3% |
| " | 403 | 0.6% |
| & | 381 | 0.6% |
| / | 229 | 0.4% |
| ? | 145 | 0.2% |
| ' | 102 | 0.2% |
| # | 62 | 0.1% |
| Other values (3) | 14 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2599 | |
| 1 | 1142 | |
| 2 | 872 | 12.7% |
| 3 | 636 | 9.3% |
| 5 | 469 | 6.8% |
| 4 | 334 | 4.9% |
| 8 | 251 | 3.7% |
| 6 | 220 | 3.2% |
| 7 | 185 | 2.7% |
| 9 | 164 | 2.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 49366 | |
| ] | 345 | 0.7% |
| } | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 33 | |
| + | 5 | 12.5% |
| ~ | 2 | 5.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 49378 | |
| [ | 345 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 389167 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3142 |
Final Punctuation
| Value | Count | Frequency (%) |
| › | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3066333 | |
| Common | 561082 | 15.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 316538 | 10.3% |
| e | 293307 | 9.6% |
| r | 281112 | 9.2% |
| l | 253946 | 8.3% |
| s | 244547 | 8.0% |
| c | 240040 | 7.8% |
| a | 233816 | 7.6% |
| i | 137021 | 4.5% |
| t | 136017 | 4.4% |
| y | 117063 | 3.8% |
| Other values (42) | 812926 |
Common
| Value | Count | Frequency (%) |
| 389167 | ||
| ( | 49378 | 8.8% |
| ) | 49366 | 8.8% |
| : | 48943 | 8.7% |
| , | 7291 | 1.3% |
| . | 4022 | 0.7% |
| - | 3142 | 0.6% |
| 0 | 2599 | 0.5% |
| 1 | 1142 | 0.2% |
| 2 | 872 | 0.2% |
| Other values (24) | 5160 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3627413 | |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 389167 | 10.7% | |
| o | 316538 | 8.7% |
| e | 293307 | 8.1% |
| r | 281112 | 7.7% |
| l | 253946 | 7.0% |
| s | 244547 | 6.7% |
| c | 240040 | 6.6% |
| a | 233816 | 6.4% |
| i | 137021 | 3.8% |
| t | 136017 | 3.7% |
| Other values (75) | 1101902 |
Punctuation
| Value | Count | Frequency (%) |
| › | 2 |
higherGeography
Text
| Distinct | 8925 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 440 |
| Missing (%) | 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 146 |
|---|---|
| Median length | 124 |
| Mean length | 39.09340095 |
| Min length | 4 |
Unique
| Unique | 3023 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | North America, Panama, Bocas Del Toro |
|---|---|
| 2nd row | North America, United States, Utah |
| 3rd row | South America, Venezuela, Bolivar |
| 4th row | North America, Mexico, Oaxaca |
| 5th row | North America, North Atlantic Ocean, United States, North Carolina, Carteret |
| Value | Count | Frequency (%) |
| america | 390243 | 12.4% |
| north | 378352 | 12.1% |
| united | 229925 | 7.3% |
| states | 225212 | 7.2% |
| africa | 111667 | 3.6% |
| south | 90792 | 2.9% |
| county | 80759 | 2.6% |
| asia | 66157 | 2.1% |
| ocean | 58408 | 1.9% |
| mexico | 50692 | 1.6% |
| Other values (5566) | 1452640 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2533836 | 10.8% | |
| a | 2342309 | 10.0% |
| i | 1683292 | 7.2% |
| t | 1628350 | 6.9% |
| e | 1586909 | 6.8% |
| r | 1444280 | 6.1% |
| , | 1372561 | 5.8% |
| o | 1263879 | 5.4% |
| n | 1236327 | 5.3% |
| c | 879180 | 3.7% |
| Other values (81) | 7524641 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16409922 | |
| Uppercase Letter | 3147373 | 13.4% |
| Space Separator | 2533836 | 10.8% |
| Other Punctuation | 1384733 | 5.9% |
| Dash Punctuation | 19470 | 0.1% |
| Open Punctuation | 106 | < 0.1% |
| Close Punctuation | 106 | < 0.1% |
| Decimal Number | 12 | < 0.1% |
| Modifier Letter | 5 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2342309 | |
| i | 1683292 | |
| t | 1628350 | |
| e | 1586909 | |
| r | 1444280 | |
| o | 1263879 | |
| n | 1236327 | |
| c | 879180 | 5.4% |
| s | 644321 | 3.9% |
| h | 637727 | 3.9% |
| Other values (35) | 3063348 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 694612 | |
| N | 456008 | |
| S | 407690 | |
| U | 266626 | 8.5% |
| C | 259605 | 8.2% |
| M | 141875 | 4.5% |
| P | 124642 | 4.0% |
| O | 99864 | 3.2% |
| B | 97350 | 3.1% |
| T | 70558 | 2.2% |
| Other values (17) | 528543 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1372561 | |
| ' | 7365 | 0.5% |
| . | 3951 | 0.3% |
| ? | 630 | < 0.1% |
| * | 122 | < 0.1% |
| / | 103 | < 0.1% |
| : | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 4 | |
| 2 | 4 | |
| 1 | 2 | |
| 0 | 1 | 8.3% |
| 8 | 1 | 8.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19466 | |
| – | 4 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2533836 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 106 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 106 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 5 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19557295 | |
| Common | 3938269 | 16.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2342309 | 12.0% |
| i | 1683292 | 8.6% |
| t | 1628350 | 8.3% |
| e | 1586909 | 8.1% |
| r | 1444280 | 7.4% |
| o | 1263879 | 6.5% |
| n | 1236327 | 6.3% |
| c | 879180 | 4.5% |
| A | 694612 | 3.6% |
| s | 644321 | 3.3% |
| Other values (62) | 6153836 |
Common
| Value | Count | Frequency (%) |
| 2533836 | ||
| , | 1372561 | |
| - | 19466 | 0.5% |
| ' | 7365 | 0.2% |
| . | 3951 | 0.1% |
| ? | 630 | < 0.1% |
| * | 122 | < 0.1% |
| ( | 106 | < 0.1% |
| ) | 106 | < 0.1% |
| / | 103 | < 0.1% |
| Other values (9) | 23 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23494048 | |
| None | 1504 | < 0.1% |
| Modifier Letters | 5 | < 0.1% |
| Punctuation | 4 | < 0.1% |
| Latin Ext Additional | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2533836 | 10.8% | |
| a | 2342309 | 10.0% |
| i | 1683292 | 7.2% |
| t | 1628350 | 6.9% |
| e | 1586909 | 6.8% |
| r | 1444280 | 6.1% |
| , | 1372561 | 5.8% |
| o | 1263879 | 5.4% |
| n | 1236327 | 5.3% |
| c | 879180 | 3.7% |
| Other values (59) | 7523125 |
None
| Value | Count | Frequency (%) |
| é | 564 | |
| ó | 346 | |
| ä | 178 | 11.8% |
| í | 176 | 11.7% |
| ê | 104 | 6.9% |
| è | 57 | 3.8% |
| ô | 53 | 3.5% |
| ū | 5 | 0.3% |
| ā | 4 | 0.3% |
| Đ | 3 | 0.2% |
| Other values (9) | 14 | 0.9% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 5 |
Punctuation
| Value | Count | Frequency (%) |
| – | 4 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ắ | 3 |
continent
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 39181 |
| Missing (%) | 6.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 10.4674249 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | SOUTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 305548 | |
| africa | 100847 | 17.9% |
| south_america | 70554 | 12.5% |
| asia | 64472 | 11.5% |
| europe | 13203 | 2.3% |
| oceania | 7485 | 1.3% |
| antarctica | 161 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1098295 | |
| R | 795861 | |
| I | 549067 | |
| C | 484756 | |
| E | 409993 | 7.0% |
| O | 396790 | 6.7% |
| T | 376424 | 6.4% |
| H | 376102 | 6.4% |
| _ | 376102 | 6.4% |
| M | 376102 | 6.4% |
| Other values (5) | 646027 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5509417 | |
| Connector Punctuation | 376102 | 6.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1098295 | |
| R | 795861 | |
| I | 549067 | |
| C | 484756 | |
| E | 409993 | 7.4% |
| O | 396790 | 7.2% |
| T | 376424 | 6.8% |
| H | 376102 | 6.8% |
| M | 376102 | 6.8% |
| N | 313194 | 5.7% |
| Other values (4) | 332833 | 6.0% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 376102 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5509417 | |
| Common | 376102 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1098295 | |
| R | 795861 | |
| I | 549067 | |
| C | 484756 | |
| E | 409993 | 7.4% |
| O | 396790 | 7.2% |
| T | 376424 | 6.8% |
| H | 376102 | 6.8% |
| M | 376102 | 6.8% |
| N | 313194 | 5.7% |
| Other values (4) | 332833 | 6.0% |
Common
| Value | Count | Frequency (%) |
| _ | 376102 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5885519 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1098295 | |
| R | 795861 | |
| I | 549067 | |
| C | 484756 | |
| E | 409993 | 7.0% |
| O | 396790 | 6.7% |
| T | 376424 | 6.4% |
| H | 376102 | 6.4% |
| _ | 376102 | 6.4% |
| M | 376102 | 6.4% |
| Other values (5) | 646027 |
waterBody
Text
Missing 
| Distinct | 1298 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 539858 |
| Missing (%) | 89.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 79 |
|---|---|
| Median length | 75 |
| Mean length | 24.02534379 |
| Min length | 6 |
Unique
| Unique | 776 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | North Atlantic Ocean |
|---|---|
| 2nd row | North Pacific Ocean, Bering Sea |
| 3rd row | North Pacific Ocean |
| 4th row | North Atlantic Ocean, Gulf Of Mexico |
| 5th row | North Pacific Ocean |
| Value | Count | Frequency (%) |
| ocean | 58130 | |
| north | 49957 | |
| atlantic | 30063 | |
| pacific | 21536 | 9.4% |
| sea | 8710 | 3.8% |
| of | 8285 | 3.6% |
| gulf | 7277 | 3.2% |
| mexico | 6087 | 2.7% |
| south | 3736 | 1.6% |
| indian | 3443 | 1.5% |
| Other values (1047) | 32100 |
Most occurring characters
| Value | Count | Frequency (%) |
| 167731 | ||
| a | 149650 | 10.1% |
| c | 142458 | 9.6% |
| t | 125319 | 8.5% |
| n | 116971 | 7.9% |
| i | 97425 | 6.6% |
| e | 90274 | 6.1% |
| o | 70318 | 4.8% |
| O | 66128 | 4.5% |
| r | 64946 | 4.4% |
| Other values (51) | 388573 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1060630 | |
| Uppercase Letter | 228943 | 15.5% |
| Space Separator | 167731 | 11.3% |
| Other Punctuation | 22340 | 1.5% |
| Dash Punctuation | 147 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 149650 | |
| c | 142458 | |
| t | 125319 | |
| n | 116971 | |
| i | 97425 | |
| e | 90274 | |
| o | 70318 | |
| r | 64946 | |
| h | 61407 | |
| l | 46029 | 4.3% |
| Other values (17) | 95833 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 66128 | |
| N | 50247 | |
| A | 32498 | |
| P | 22062 | 9.6% |
| S | 16927 | 7.4% |
| G | 7662 | 3.3% |
| C | 7479 | 3.3% |
| M | 7332 | 3.2% |
| B | 7248 | 3.2% |
| I | 3893 | 1.7% |
| Other values (15) | 7467 | 3.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 22196 | |
| ? | 67 | 0.3% |
| . | 43 | 0.2% |
| ' | 33 | 0.1% |
| * | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 167731 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 147 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1289573 | |
| Common | 190220 | 12.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 149650 | |
| c | 142458 | |
| t | 125319 | |
| n | 116971 | 9.1% |
| i | 97425 | 7.6% |
| e | 90274 | 7.0% |
| o | 70318 | 5.5% |
| O | 66128 | 5.1% |
| r | 64946 | 5.0% |
| h | 61407 | 4.8% |
| Other values (42) | 304677 |
Common
| Value | Count | Frequency (%) |
| 167731 | ||
| , | 22196 | 11.7% |
| - | 147 | 0.1% |
| ? | 67 | < 0.1% |
| . | 43 | < 0.1% |
| ' | 33 | < 0.1% |
| * | 1 | < 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1479792 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 167731 | ||
| a | 149650 | 10.1% |
| c | 142458 | 9.6% |
| t | 125319 | 8.5% |
| n | 116971 | 7.9% |
| i | 97425 | 6.6% |
| e | 90274 | 6.1% |
| o | 70318 | 4.8% |
| O | 66128 | 4.5% |
| r | 64946 | 4.4% |
| Other values (50) | 388572 |
None
| Value | Count | Frequency (%) |
| ö | 1 |
islandGroup
Text
Missing 
| Distinct | 68 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 596682 |
| Missing (%) | 99.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 24 |
| Mean length | 13.28538478 |
| Min length | 8 |
Unique
| Unique | 18 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Pribilof Islands |
|---|---|
| 2nd row | Pribilof Islands |
| 3rd row | Ryukyu Islands |
| 4th row | Pribilof Islands |
| 5th row | Batan Islands |
| Value | Count | Frequency (%) |
| islands | 3374 | |
| pribilof | 1808 | |
| moluccas | 1194 | 14.4% |
| ryukyu | 497 | 6.0% |
| babuyan | 176 | 2.1% |
| channel | 159 | 1.9% |
| batan | 120 | 1.5% |
| nicobar | 108 | 1.3% |
| bismarck | 94 | 1.1% |
| yap | 83 | 1.0% |
| Other values (66) | 653 | 7.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 8103 | |
| l | 6718 | 10.6% |
| a | 6381 | 10.1% |
| n | 4444 | 7.0% |
| i | 4222 | 6.7% |
| d | 3521 | 5.6% |
| 3497 | 5.5% | |
| I | 3376 | 5.3% |
| o | 3353 | 5.3% |
| c | 2688 | 4.2% |
| Other values (36) | 17055 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 51599 | |
| Uppercase Letter | 8262 | 13.0% |
| Space Separator | 3497 | 5.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 8103 | |
| l | 6718 | |
| a | 6381 | |
| n | 4444 | |
| i | 4222 | |
| d | 3521 | |
| o | 3353 | |
| c | 2688 | 5.2% |
| u | 2566 | 5.0% |
| r | 2242 | 4.3% |
| Other values (14) | 7361 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 3376 | |
| P | 1814 | |
| M | 1235 | 14.9% |
| R | 497 | 6.0% |
| B | 412 | 5.0% |
| C | 183 | 2.2% |
| S | 153 | 1.9% |
| A | 151 | 1.8% |
| N | 122 | 1.5% |
| Y | 83 | 1.0% |
| Other values (11) | 236 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 3497 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 59861 | |
| Common | 3497 | 5.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 8103 | |
| l | 6718 | |
| a | 6381 | |
| n | 4444 | 7.4% |
| i | 4222 | 7.1% |
| d | 3521 | 5.9% |
| I | 3376 | 5.6% |
| o | 3353 | 5.6% |
| c | 2688 | 4.5% |
| u | 2566 | 4.3% |
| Other values (35) | 14489 |
Common
| Value | Count | Frequency (%) |
| 3497 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 63358 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 8103 | |
| l | 6718 | 10.6% |
| a | 6381 | 10.1% |
| n | 4444 | 7.0% |
| i | 4222 | 6.7% |
| d | 3521 | 5.6% |
| 3497 | 5.5% | |
| I | 3376 | 5.3% |
| o | 3353 | 5.3% |
| c | 2688 | 4.2% |
| Other values (36) | 17055 |
island
Text
Missing 
| Distinct | 345 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 564842 |
| Missing (%) | 93.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 21 |
| Mean length | 8.146903767 |
| Min length | 1 |
Unique
| Unique | 103 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | St. Paul Island |
|---|---|
| 2nd row | St. Paul Island |
| 3rd row | Trinidad |
| 4th row | Borneo |
| 5th row | Culion Island |
| Value | Count | Frequency (%) |
| island | 7184 | |
| borneo | 5932 | 12.2% |
| sumatra | 3675 | 7.5% |
| luzon | 3124 | 6.4% |
| java | 3005 | 6.2% |
| celebes | 2678 | 5.5% |
| trinidad | 2605 | 5.4% |
| st | 1818 | 3.7% |
| paul | 1799 | 3.7% |
| honshu | 1290 | 2.6% |
| Other values (366) | 15576 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 39564 | |
| n | 28846 | 9.7% |
| o | 23778 | 8.0% |
| e | 21049 | 7.1% |
| r | 16512 | 5.5% |
| d | 15796 | 5.3% |
| l | 15656 | 5.2% |
| s | 14538 | 4.9% |
| u | 14063 | 4.7% |
| 12077 | 4.0% | |
| Other values (47) | 96371 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 235808 | |
| Uppercase Letter | 48529 | 16.3% |
| Space Separator | 12077 | 4.0% |
| Other Punctuation | 1830 | 0.6% |
| Dash Punctuation | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 39564 | |
| n | 28846 | |
| o | 23778 | |
| e | 21049 | |
| r | 16512 | |
| d | 15796 | 6.7% |
| l | 15656 | 6.6% |
| s | 14538 | 6.2% |
| u | 14063 | 6.0% |
| i | 11254 | 4.8% |
| Other values (16) | 34752 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 7839 | |
| B | 7137 | |
| S | 7123 | |
| L | 4203 | |
| C | 3825 | |
| P | 3689 | |
| T | 3258 | |
| J | 3022 | 6.2% |
| N | 2160 | 4.5% |
| H | 1664 | 3.4% |
| Other values (14) | 4609 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1817 | |
| ' | 9 | 0.5% |
| ? | 2 | 0.1% |
| * | 1 | 0.1% |
| , | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 12077 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 284337 | |
| Common | 13913 | 4.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 39564 | |
| n | 28846 | 10.1% |
| o | 23778 | 8.4% |
| e | 21049 | 7.4% |
| r | 16512 | 5.8% |
| d | 15796 | 5.6% |
| l | 15656 | 5.5% |
| s | 14538 | 5.1% |
| u | 14063 | 4.9% |
| i | 11254 | 4.0% |
| Other values (40) | 83281 |
Common
| Value | Count | Frequency (%) |
| 12077 | ||
| . | 1817 | 13.1% |
| ' | 9 | 0.1% |
| - | 6 | < 0.1% |
| ? | 2 | < 0.1% |
| * | 1 | < 0.1% |
| , | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 298250 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 39564 | |
| n | 28846 | 9.7% |
| o | 23778 | 8.0% |
| e | 21049 | 7.1% |
| r | 16512 | 5.5% |
| d | 15796 | 5.3% |
| l | 15656 | 5.2% |
| s | 14538 | 4.9% |
| u | 14063 | 4.7% |
| 12077 | 4.0% | |
| Other values (47) | 96371 |
countryCode
Text
| Distinct | 221 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4662 |
| Missing (%) | 0.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PA |
|---|---|
| 2nd row | US |
| 3rd row | VE |
| 4th row | MX |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 226290 | |
| mx | 35569 | 6.0% |
| pa | 25486 | 4.3% |
| ve | 24981 | 4.2% |
| ca | 19304 | 3.2% |
| co | 16625 | 2.8% |
| id | 14924 | 2.5% |
| zz | 13450 | 2.3% |
| br | 12246 | 2.1% |
| za | 11853 | 2.0% |
| Other values (211) | 196061 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 237899 | |
| U | 236282 | |
| A | 75072 | 6.3% |
| M | 63999 | 5.4% |
| C | 56100 | 4.7% |
| E | 50124 | 4.2% |
| P | 48406 | 4.1% |
| Z | 47388 | 4.0% |
| G | 36679 | 3.1% |
| X | 35577 | 3.0% |
| Other values (16) | 306052 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1193578 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 237899 | |
| U | 236282 | |
| A | 75072 | 6.3% |
| M | 63999 | 5.4% |
| C | 56100 | 4.7% |
| E | 50124 | 4.2% |
| P | 48406 | 4.1% |
| Z | 47388 | 4.0% |
| G | 36679 | 3.1% |
| X | 35577 | 3.0% |
| Other values (16) | 306052 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1193578 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 237899 | |
| U | 236282 | |
| A | 75072 | 6.3% |
| M | 63999 | 5.4% |
| C | 56100 | 4.7% |
| E | 50124 | 4.2% |
| P | 48406 | 4.1% |
| Z | 47388 | 4.0% |
| G | 36679 | 3.1% |
| X | 35577 | 3.0% |
| Other values (16) | 306052 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1193578 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 237899 | |
| U | 236282 | |
| A | 75072 | 6.3% |
| M | 63999 | 5.4% |
| C | 56100 | 4.7% |
| E | 50124 | 4.2% |
| P | 48406 | 4.1% |
| Z | 47388 | 4.0% |
| G | 36679 | 3.1% |
| X | 35577 | 3.0% |
| Other values (16) | 306052 |
stateProvince
Text
Missing 
| Distinct | 1750 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 93954 |
| Missing (%) | 15.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 27 |
| Mean length | 9.156487625 |
| Min length | 1 |
Unique
| Unique | 314 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Bocas Del Toro |
|---|---|
| 2nd row | Utah |
| 3rd row | Bolivar |
| 4th row | Oaxaca |
| 5th row | North Carolina |
| Value | Count | Frequency (%) |
| california | 37958 | 5.7% |
| new | 18698 | 2.8% |
| alaska | 18000 | 2.7% |
| oregon | 15112 | 2.3% |
| province | 15077 | 2.2% |
| arizona | 13072 | 1.9% |
| virginia | 12189 | 1.8% |
| washington | 12057 | 1.8% |
| texas | 11524 | 1.7% |
| mexico | 9875 | 1.5% |
| Other values (1720) | 507096 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 685721 | |
| i | 388351 | 8.4% |
| n | 356516 | 7.7% |
| o | 350614 | 7.5% |
| r | 326855 | 7.0% |
| e | 277944 | 6.0% |
| l | 192295 | 4.1% |
| s | 173201 | 3.7% |
| t | 172374 | 3.7% |
| 163161 | 3.5% | |
| Other values (65) | 1559858 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3782086 | |
| Uppercase Letter | 683335 | 14.7% |
| Space Separator | 163161 | 3.5% |
| Dash Punctuation | 15111 | 0.3% |
| Other Punctuation | 3190 | 0.1% |
| Decimal Number | 4 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 685721 | |
| i | 388351 | |
| n | 356516 | |
| o | 350614 | |
| r | 326855 | |
| e | 277944 | 7.3% |
| l | 192295 | 5.1% |
| s | 173201 | 4.6% |
| t | 172374 | 4.6% |
| u | 116650 | 3.1% |
| Other values (25) | 741565 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 96322 | |
| A | 66126 | 9.7% |
| N | 63963 | 9.4% |
| M | 54370 | 8.0% |
| S | 44892 | 6.6% |
| T | 39318 | 5.8% |
| P | 37886 | 5.5% |
| B | 35544 | 5.2% |
| W | 30828 | 4.5% |
| O | 27556 | 4.0% |
| Other values (16) | 186530 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 2998 | |
| ? | 159 | 5.0% |
| / | 21 | 0.7% |
| * | 6 | 0.2% |
| . | 5 | 0.2% |
| : | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 0 | 1 | |
| 8 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 163161 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15111 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4465421 | |
| Common | 181469 | 3.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 685721 | |
| i | 388351 | 8.7% |
| n | 356516 | 8.0% |
| o | 350614 | 7.9% |
| r | 326855 | 7.3% |
| e | 277944 | 6.2% |
| l | 192295 | 4.3% |
| s | 173201 | 3.9% |
| t | 172374 | 3.9% |
| u | 116650 | 2.6% |
| Other values (51) | 1424900 |
Common
| Value | Count | Frequency (%) |
| 163161 | ||
| - | 15111 | 8.3% |
| ' | 2998 | 1.7% |
| ? | 159 | 0.1% |
| / | 21 | < 0.1% |
| * | 6 | < 0.1% |
| . | 5 | < 0.1% |
| 1 | 2 | < 0.1% |
| 0 | 1 | < 0.1% |
| : | 1 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4645873 | |
| None | 1017 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 685721 | |
| i | 388351 | 8.4% |
| n | 356516 | 7.7% |
| o | 350614 | 7.5% |
| r | 326855 | 7.0% |
| e | 277944 | 6.0% |
| l | 192295 | 4.1% |
| s | 173201 | 3.7% |
| t | 172374 | 3.7% |
| 163161 | 3.5% | |
| Other values (56) | 1558841 |
None
| Value | Count | Frequency (%) |
| é | 367 | |
| ó | 346 | |
| ä | 178 | |
| ê | 92 | 9.0% |
| ô | 30 | 2.9% |
| ç | 1 | 0.1% |
| ã | 1 | 0.1% |
| ō | 1 | 0.1% |
| æ | 1 | 0.1% |
county
Text
Missing 
| Distinct | 3194 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 447402 |
| Missing (%) | 74.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 47 |
|---|---|
| Median length | 27 |
| Mean length | 13.46725393 |
| Min length | 1 |
Unique
| Unique | 663 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Carteret |
|---|---|
| 2nd row | Cusco |
| 3rd row | Monterey County |
| 4th row | Galveston |
| 5th row | Tamana Ward |
| Value | Count | Frequency (%) |
| county | 80697 | |
| district | 13828 | 4.7% |
| islands | 3705 | 1.3% |
| division | 3460 | 1.2% |
| san | 3315 | 1.1% |
| province | 2619 | 0.9% |
| schoolcraft | 2179 | 0.7% |
| mackenzie | 1966 | 0.7% |
| lane | 1935 | 0.7% |
| municipality | 1862 | 0.6% |
| Other values (2969) | 178313 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 189818 | 9.1% |
| o | 175404 | 8.5% |
| t | 161467 | 7.8% |
| a | 160330 | 7.7% |
| 139830 | 6.7% | |
| i | 120188 | 5.8% |
| u | 116014 | 5.6% |
| e | 111686 | 5.4% |
| r | 102364 | 4.9% |
| C | 99007 | 4.8% |
| Other values (69) | 698509 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1630734 | |
| Uppercase Letter | 298270 | 14.4% |
| Space Separator | 139830 | 6.7% |
| Dash Punctuation | 4189 | 0.2% |
| Other Punctuation | 1555 | 0.1% |
| Close Punctuation | 13 | < 0.1% |
| Open Punctuation | 13 | < 0.1% |
| Decimal Number | 8 | < 0.1% |
| Modifier Letter | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 189818 | |
| o | 175404 | |
| t | 161467 | |
| a | 160330 | |
| i | 120188 | 7.4% |
| u | 116014 | 7.1% |
| e | 111686 | 6.8% |
| r | 102364 | 6.3% |
| y | 97639 | 6.0% |
| s | 76836 | 4.7% |
| Other values (28) | 318988 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 99007 | |
| D | 27665 | 9.3% |
| S | 18077 | 6.1% |
| M | 17795 | 6.0% |
| B | 15214 | 5.1% |
| P | 13875 | 4.7% |
| A | 12422 | 4.2% |
| L | 11112 | 3.7% |
| G | 10792 | 3.6% |
| W | 8980 | 3.0% |
| Other values (17) | 63331 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 1171 | |
| . | 192 | 12.3% |
| * | 113 | 7.3% |
| ? | 56 | 3.6% |
| / | 21 | 1.4% |
| , | 2 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4185 | |
| – | 4 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 4 | 4 |
Space Separator
| Value | Count | Frequency (%) |
| 139830 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 13 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 13 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1929004 | |
| Common | 145613 | 7.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 189818 | 9.8% |
| o | 175404 | 9.1% |
| t | 161467 | 8.4% |
| a | 160330 | 8.3% |
| i | 120188 | 6.2% |
| u | 116014 | 6.0% |
| e | 111686 | 5.8% |
| r | 102364 | 5.3% |
| C | 99007 | 5.1% |
| y | 97639 | 5.1% |
| Other values (55) | 595087 |
Common
| Value | Count | Frequency (%) |
| 139830 | ||
| - | 4185 | 2.9% |
| ' | 1171 | 0.8% |
| . | 192 | 0.1% |
| * | 113 | 0.1% |
| ? | 56 | < 0.1% |
| / | 21 | < 0.1% |
| ) | 13 | < 0.1% |
| ( | 13 | < 0.1% |
| ʻ | 5 | < 0.1% |
| Other values (4) | 14 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2074120 | |
| None | 485 | < 0.1% |
| Modifier Letters | 5 | < 0.1% |
| Punctuation | 4 | < 0.1% |
| Latin Ext Additional | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 189818 | 9.2% |
| o | 175404 | 8.5% |
| t | 161467 | 7.8% |
| a | 160330 | 7.7% |
| 139830 | 6.7% | |
| i | 120188 | 5.8% |
| u | 116014 | 5.6% |
| e | 111686 | 5.4% |
| r | 102364 | 4.9% |
| C | 99007 | 4.8% |
| Other values (54) | 698012 |
None
| Value | Count | Frequency (%) |
| é | 197 | |
| í | 176 | |
| è | 57 | 11.8% |
| ô | 23 | 4.7% |
| ê | 12 | 2.5% |
| ū | 5 | 1.0% |
| ā | 4 | 0.8% |
| Đ | 3 | 0.6% |
| ơ | 3 | 0.6% |
| à | 3 | 0.6% |
| Other values (2) | 2 | 0.4% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 5 |
Punctuation
| Value | Count | Frequency (%) |
| – | 4 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ắ | 3 |
locality
Text
Missing 
| Distinct | 86656 |
|---|---|
| Distinct (%) | 15.3% |
| Missing | 35404 |
| Missing (%) | 5.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 294 |
|---|---|
| Median length | 159 |
| Mean length | 21.69044267 |
| Min length | 1 |
Unique
| Unique | 52764 ? |
|---|---|
| Unique (%) | 9.3% |
Sample
| 1st row | Tierra Oscura, 3.5 Km S. Tiger Key |
|---|---|
| 2nd row | Uinta Forest, Currant Creek |
| 3rd row | km. 125, 85 Km SSE El Dorado |
| 4th row | Totontepec |
| 5th row | Atlantic Beach, Atlantic Beach, 1/2 Mi E Of Triple S Pier. |
| Value | Count | Frequency (%) |
| km | 82857 | 3.9% |
| mi | 82389 | 3.8% |
| of | 34259 | 1.6% |
| n | 30440 | 1.4% |
| river | 28140 | 1.3% |
| s | 27057 | 1.3% |
| e | 26413 | 1.2% |
| w | 26172 | 1.2% |
| island | 23296 | 1.1% |
| san | 23251 | 1.1% |
| Other values (42744) | 1760837 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1579064 | 12.9% | |
| a | 1198874 | 9.8% |
| e | 766623 | 6.2% |
| i | 659790 | 5.4% |
| n | 655819 | 5.3% |
| o | 653029 | 5.3% |
| r | 550116 | 4.5% |
| l | 446951 | 3.6% |
| t | 434393 | 3.5% |
| , | 393002 | 3.2% |
| Other values (116) | 4940149 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7761837 | |
| Uppercase Letter | 2026832 | 16.5% |
| Space Separator | 1579064 | 12.9% |
| Other Punctuation | 489421 | 4.0% |
| Decimal Number | 361074 | 2.9% |
| Open Punctuation | 19801 | 0.2% |
| Close Punctuation | 19779 | 0.2% |
| Dash Punctuation | 15950 | 0.1% |
| Math Symbol | 3991 | < 0.1% |
| Connector Punctuation | 54 | < 0.1% |
| Other values (3) | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1198874 | |
| e | 766623 | |
| i | 659790 | 8.5% |
| n | 655819 | 8.4% |
| o | 653029 | 8.4% |
| r | 550116 | 7.1% |
| l | 446951 | 5.8% |
| t | 434393 | 5.6% |
| s | 353920 | 4.6% |
| u | 324066 | 4.2% |
| Other values (49) | 1718256 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 227459 | 11.2% |
| M | 200123 | 9.9% |
| C | 146981 | 7.3% |
| N | 141674 | 7.0% |
| K | 124320 | 6.1% |
| R | 117187 | 5.8% |
| B | 112739 | 5.6% |
| P | 108902 | 5.4% |
| E | 107630 | 5.3% |
| W | 98674 | 4.9% |
| Other values (21) | 641143 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 393002 | |
| . | 71840 | 14.7% |
| ; | 9568 | 2.0% |
| ' | 6996 | 1.4% |
| / | 2669 | 0.5% |
| : | 2390 | 0.5% |
| " | 1272 | 0.3% |
| ? | 612 | 0.1% |
| & | 491 | 0.1% |
| # | 388 | 0.1% |
| Other values (3) | 193 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 75042 | |
| 2 | 56929 | |
| 5 | 50938 | |
| 0 | 37299 | |
| 3 | 35827 | |
| 4 | 29576 | 8.2% |
| 6 | 25291 | 7.0% |
| 8 | 19038 | 5.3% |
| 7 | 17890 | 5.0% |
| 9 | 13244 | 3.7% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 3747 | |
| + | 184 | 4.6% |
| ~ | 60 | 1.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10013 | |
| [ | 9788 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 9993 | |
| ] | 9786 |
Space Separator
| Value | Count | Frequency (%) |
| 1579064 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15950 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 54 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3 |
Final Punctuation
| Value | Count | Frequency (%) |
| › | 2 |
Other Number
| Value | Count | Frequency (%) |
| ¼ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9788654 | |
| Common | 2489141 | 20.3% |
| Cyrillic | 15 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1198874 | 12.2% |
| e | 766623 | 7.8% |
| i | 659790 | 6.7% |
| n | 655819 | 6.7% |
| o | 653029 | 6.7% |
| r | 550116 | 5.6% |
| l | 446951 | 4.6% |
| t | 434393 | 4.4% |
| s | 353920 | 3.6% |
| u | 324066 | 3.3% |
| Other values (68) | 3745073 |
Common
| Value | Count | Frequency (%) |
| 1579064 | ||
| , | 393002 | 15.8% |
| 1 | 75042 | 3.0% |
| . | 71840 | 2.9% |
| 2 | 56929 | 2.3% |
| 5 | 50938 | 2.0% |
| 0 | 37299 | 1.5% |
| 3 | 35827 | 1.4% |
| 4 | 29576 | 1.2% |
| 6 | 25291 | 1.0% |
| Other values (26) | 134333 | 5.4% |
Cyrillic
| Value | Count | Frequency (%) |
| л | 3 | |
| к | 2 | |
| т | 1 | 6.7% |
| і | 1 | 6.7% |
| ө | 1 | 6.7% |
| ы | 1 | 6.7% |
| а | 1 | 6.7% |
| м | 1 | 6.7% |
| н | 1 | 6.7% |
| е | 1 | 6.7% |
| Other values (2) | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12277174 | |
| None | 619 | < 0.1% |
| Cyrillic | 15 | < 0.1% |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1579064 | 12.9% | |
| a | 1198874 | 9.8% |
| e | 766623 | 6.2% |
| i | 659790 | 5.4% |
| n | 655819 | 5.3% |
| o | 653029 | 5.3% |
| r | 550116 | 4.5% |
| l | 446951 | 3.6% |
| t | 434393 | 3.5% |
| , | 393002 | 3.2% |
| Other values (75) | 4939513 |
None
| Value | Count | Frequency (%) |
| é | 382 | |
| è | 107 | 17.3% |
| ø | 19 | 3.1% |
| ñ | 19 | 3.1% |
| á | 11 | 1.8% |
| ö | 11 | 1.8% |
| ã | 7 | 1.1% |
| ü | 7 | 1.1% |
| ó | 7 | 1.1% |
| Œ | 6 | 1.0% |
| Other values (18) | 43 | 6.9% |
Cyrillic
| Value | Count | Frequency (%) |
| л | 3 | |
| к | 2 | |
| т | 1 | 6.7% |
| і | 1 | 6.7% |
| ө | 1 | 6.7% |
| ы | 1 | 6.7% |
| а | 1 | 6.7% |
| м | 1 | 6.7% |
| н | 1 | 6.7% |
| е | 1 | 6.7% |
| Other values (2) | 2 |
Punctuation
| Value | Count | Frequency (%) |
| › | 2 |
Missing 
| Distinct | 29 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 599861 |
| Missing (%) | 99.7% |
| Memory size | 4.6 MiB |
Length
| Max length | 33 |
|---|---|
| Median length | 8 |
| Mean length | 8.518867925 |
| Min length | 2 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | sea level |
|---|---|
| 2nd row | sealevel |
| 3rd row | sealevel |
| 4th row | sealevel |
| 5th row | see Osgood 1909:214 |
| Value | Count | Frequency (%) |
| sealevel | 1096 | |
| sea | 280 | 12.0% |
| level | 277 | 11.9% |
| ft | 143 | 6.1% |
| 104 | 4.5% | |
| 100 | 81 | 3.5% |
| m | 59 | 2.5% |
| near | 32 | 1.4% |
| below | 30 | 1.3% |
| 3 | 28 | 1.2% |
| Other values (33) | 206 | 8.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4198 | |
| l | 2792 | |
| a | 1481 | 10.9% |
| s | 1380 | 10.2% |
| v | 1376 | 10.2% |
| 746 | 5.5% | |
| 0 | 314 | 2.3% |
| t | 156 | 1.2% |
| 1 | 152 | 1.1% |
| f | 143 | 1.1% |
| Other values (33) | 807 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12018 | |
| Space Separator | 746 | 5.5% |
| Decimal Number | 555 | 4.1% |
| Math Symbol | 110 | 0.8% |
| Uppercase Letter | 87 | 0.6% |
| Dash Punctuation | 22 | 0.2% |
| Other Punctuation | 5 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4198 | |
| l | 2792 | |
| a | 1481 | 12.3% |
| s | 1380 | 11.5% |
| v | 1376 | 11.4% |
| t | 156 | 1.3% |
| f | 143 | 1.2% |
| c | 92 | 0.8% |
| m | 62 | 0.5% |
| r | 61 | 0.5% |
| Other values (12) | 277 | 2.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 314 | |
| 1 | 152 | |
| 3 | 52 | 9.4% |
| 5 | 16 | 2.9% |
| 2 | 10 | 1.8% |
| 7 | 6 | 1.1% |
| 9 | 3 | 0.5% |
| 4 | 2 | 0.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 28 | |
| G | 28 | |
| S | 28 | |
| M | 1 | 1.1% |
| K | 1 | 1.1% |
| O | 1 | 1.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 | |
| : | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 746 |
Math Symbol
| Value | Count | Frequency (%) |
| < | 110 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 22 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12105 | |
| Common | 1440 | 10.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4198 | |
| l | 2792 | |
| a | 1481 | 12.2% |
| s | 1380 | 11.4% |
| v | 1376 | 11.4% |
| t | 156 | 1.3% |
| f | 143 | 1.2% |
| c | 92 | 0.8% |
| m | 62 | 0.5% |
| r | 61 | 0.5% |
| Other values (18) | 364 | 3.0% |
Common
| Value | Count | Frequency (%) |
| 746 | ||
| 0 | 314 | |
| 1 | 152 | 10.6% |
| < | 110 | 7.6% |
| 3 | 52 | 3.6% |
| - | 22 | 1.5% |
| 5 | 16 | 1.1% |
| 2 | 10 | 0.7% |
| 7 | 6 | 0.4% |
| 9 | 3 | 0.2% |
| Other values (5) | 9 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13545 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 4198 | |
| l | 2792 | |
| a | 1481 | 10.9% |
| s | 1380 | 10.2% |
| v | 1376 | 10.2% |
| 746 | 5.5% | |
| 0 | 314 | 2.3% |
| t | 156 | 1.2% |
| 1 | 152 | 1.1% |
| f | 143 | 1.1% |
| Other values (33) | 807 | 6.0% |
decimalLatitude
Text
Missing 
| Distinct | 10276 |
|---|---|
| Distinct (%) | 6.7% |
| Missing | 447917 |
| Missing (%) | 74.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 5.057648469 |
| Min length | 3 |
Unique
| Unique | 4988 ? |
|---|---|
| Unique (%) | 3.2% |
Sample
| 1st row | 5.98 |
|---|---|
| 2nd row | 34.68 |
| 3rd row | 31.5011 |
| 4th row | 29.37 |
| 5th row | 34.4863 |
| Value | Count | Frequency (%) |
| 5.3 | 1716 | 1.1% |
| 2.78 | 1090 | 0.7% |
| 5.67 | 1073 | 0.7% |
| 0.88 | 979 | 0.6% |
| 3.65 | 946 | 0.6% |
| 8.83 | 814 | 0.5% |
| 10.53 | 811 | 0.5% |
| 3.17 | 798 | 0.5% |
| 8.17 | 759 | 0.5% |
| 7.32 | 742 | 0.5% |
| Other values (9288) | 143806 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 153534 | |
| 3 | 86948 | |
| 2 | 77296 | |
| 1 | 68806 | |
| 5 | 67724 | |
| 8 | 61654 | |
| 7 | 57999 | 7.5% |
| 6 | 42953 | 5.5% |
| 0 | 42475 | 5.5% |
| 9 | 41167 | 5.3% |
| Other values (2) | 75965 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 585108 | |
| Other Punctuation | 153534 | 19.8% |
| Dash Punctuation | 37879 | 4.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 86948 | |
| 2 | 77296 | |
| 1 | 68806 | |
| 5 | 67724 | |
| 8 | 61654 | |
| 7 | 57999 | |
| 6 | 42953 | |
| 0 | 42475 | |
| 9 | 41167 | |
| 4 | 38086 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 153534 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 37879 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 776521 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 153534 | |
| 3 | 86948 | |
| 2 | 77296 | |
| 1 | 68806 | |
| 5 | 67724 | |
| 8 | 61654 | |
| 7 | 57999 | 7.5% |
| 6 | 42953 | 5.5% |
| 0 | 42475 | 5.5% |
| 9 | 41167 | 5.3% |
| Other values (2) | 75965 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 776521 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 153534 | |
| 3 | 86948 | |
| 2 | 77296 | |
| 1 | 68806 | |
| 5 | 67724 | |
| 8 | 61654 | |
| 7 | 57999 | 7.5% |
| 6 | 42953 | 5.5% |
| 0 | 42475 | 5.5% |
| 9 | 41167 | 5.3% |
| Other values (2) | 75965 |
decimalLongitude
Text
Missing 
| Distinct | 11872 |
|---|---|
| Distinct (%) | 7.7% |
| Missing | 447917 |
| Missing (%) | 74.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 5.660244636 |
| Min length | 3 |
Unique
| Unique | 5896 ? |
|---|---|
| Unique (%) | 3.8% |
Sample
| 1st row | -61.43 |
|---|---|
| 2nd row | -76.7 |
| 3rd row | 65.8453 |
| 4th row | -94.82 |
| 5th row | 74.6026 |
| Value | Count | Frequency (%) |
| 66.22 | 1723 | 1.1% |
| 16.42 | 1090 | 0.7% |
| 127.68 | 955 | 0.6% |
| 0.2 | 930 | 0.6% |
| 70.5 | 790 | 0.5% |
| 71.95 | 739 | 0.5% |
| 79.62 | 722 | 0.5% |
| 0.22 | 681 | 0.4% |
| 0.97 | 651 | 0.4% |
| 66.18 | 629 | 0.4% |
| Other values (11081) | 144624 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 153534 | |
| - | 87899 | |
| 2 | 86728 | |
| 1 | 82470 | |
| 7 | 81082 | |
| 3 | 68918 | |
| 6 | 62220 | |
| 5 | 59306 | 6.8% |
| 8 | 58821 | 6.8% |
| 0 | 51094 | 5.9% |
| Other values (2) | 76968 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 627607 | |
| Other Punctuation | 153534 | 17.7% |
| Dash Punctuation | 87899 | 10.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 86728 | |
| 1 | 82470 | |
| 7 | 81082 | |
| 3 | 68918 | |
| 6 | 62220 | |
| 5 | 59306 | |
| 8 | 58821 | |
| 0 | 51094 | |
| 4 | 38863 | |
| 9 | 38105 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 153534 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 87899 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 869040 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 153534 | |
| - | 87899 | |
| 2 | 86728 | |
| 1 | 82470 | |
| 7 | 81082 | |
| 3 | 68918 | |
| 6 | 62220 | |
| 5 | 59306 | 6.8% |
| 8 | 58821 | 6.8% |
| 0 | 51094 | 5.9% |
| Other values (2) | 76968 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 869040 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 153534 | |
| - | 87899 | |
| 2 | 86728 | |
| 1 | 82470 | |
| 7 | 81082 | |
| 3 | 68918 | |
| 6 | 62220 | |
| 5 | 59306 | 6.8% |
| 8 | 58821 | 6.8% |
| 0 | 51094 | 5.9% |
| Other values (2) | 76968 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 468202 |
| Missing (%) | 77.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 22.96475771 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|---|
| 2nd row | Degrees Minutes Seconds |
| 3rd row | Degrees Minutes Seconds |
| 4th row | Degrees Minutes Seconds |
| 5th row | Degrees Minutes Seconds |
| Value | Count | Frequency (%) |
| degrees | 133004 | |
| minutes | 133003 | |
| seconds | 133003 | |
| utm | 192 | < 0.1% |
| unknown | 53 | < 0.1% |
| decimal | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 665019 | |
| s | 399010 | |
| n | 266165 | 8.7% |
| 266007 | 8.7% | |
| M | 133195 | 4.4% |
| o | 133056 | 4.3% |
| D | 133004 | 4.3% |
| c | 133004 | 4.3% |
| g | 133004 | 4.3% |
| r | 133004 | 4.3% |
| Other values (12) | 665563 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2394385 | |
| Uppercase Letter | 399639 | 13.1% |
| Space Separator | 266007 | 8.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 665019 | |
| s | 399010 | |
| n | 266165 | |
| o | 133056 | 5.6% |
| c | 133004 | 5.6% |
| g | 133004 | 5.6% |
| r | 133004 | 5.6% |
| i | 133004 | 5.6% |
| d | 133004 | 5.6% |
| t | 133003 | 5.6% |
| Other values (6) | 133112 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 133195 | |
| D | 133004 | |
| S | 133003 | |
| U | 245 | 0.1% |
| T | 192 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 266007 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2794024 | |
| Common | 266007 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 665019 | |
| s | 399010 | |
| n | 266165 | |
| M | 133195 | 4.8% |
| o | 133056 | 4.8% |
| D | 133004 | 4.8% |
| c | 133004 | 4.8% |
| g | 133004 | 4.8% |
| r | 133004 | 4.8% |
| i | 133004 | 4.8% |
| Other values (11) | 532559 |
Common
| Value | Count | Frequency (%) |
| 266007 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3060031 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 665019 | |
| s | 399010 | |
| n | 266165 | 8.7% |
| 266007 | 8.7% | |
| M | 133195 | 4.4% |
| o | 133056 | 4.3% |
| D | 133004 | 4.3% |
| c | 133004 | 4.3% |
| g | 133004 | 4.3% |
| r | 133004 | 4.3% |
| Other values (12) | 665563 |
Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 592196 |
| Missing (%) | 98.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 26 |
|---|---|
| Median length | 12 |
| Mean length | 10.66731496 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Google Earth |
|---|---|
| 2nd row | Google Earth |
| 3rd row | GPS |
| 4th row | Google Earth |
| 5th row | Google Earth |
| Value | Count | Frequency (%) |
| 7074 | ||
| earth | 7074 | |
| gps | 1418 | 8.3% |
| usgs | 530 | 3.1% |
| topoview | 530 | 3.1% |
| gazetteer | 137 | 0.8% |
| atlas | 42 | 0.2% |
| of | 42 | 0.2% |
| canada | 42 | 0.2% |
| 42 | 0.2% | |
| Other values (4) | 96 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 15334 | |
| G | 9159 | |
| e | 8096 | |
| t | 8000 | |
| 7772 | ||
| a | 7479 | |
| r | 7294 | |
| l | 7116 | |
| h | 7076 | |
| g | 7074 | |
| Other values (22) | 14326 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 69543 | |
| Uppercase Letter | 21368 | 21.6% |
| Space Separator | 7772 | 7.9% |
| Dash Punctuation | 42 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 15334 | |
| e | 8096 | |
| t | 8000 | |
| a | 7479 | |
| r | 7294 | |
| l | 7116 | |
| h | 7076 | |
| g | 7074 | |
| p | 586 | 0.8% |
| w | 530 | 0.8% |
| Other values (8) | 958 | 1.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 9159 | |
| E | 7074 | |
| S | 2478 | 11.6% |
| P | 1418 | 6.6% |
| V | 530 | 2.5% |
| U | 530 | 2.5% |
| A | 42 | 0.2% |
| C | 42 | 0.2% |
| T | 42 | 0.2% |
| I | 39 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 7772 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 42 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 90911 | |
| Common | 7815 | 7.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 15334 | |
| G | 9159 | |
| e | 8096 | |
| t | 8000 | |
| a | 7479 | |
| r | 7294 | |
| l | 7116 | |
| h | 7076 | |
| g | 7074 | |
| E | 7074 | |
| Other values (19) | 7209 |
Common
| Value | Count | Frequency (%) |
| 7772 | ||
| - | 42 | 0.5% |
| . | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 98726 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 15334 | |
| G | 9159 | |
| e | 8096 | |
| t | 8000 | |
| 7772 | ||
| a | 7479 | |
| r | 7294 | |
| l | 7116 | |
| h | 7076 | |
| g | 7074 | |
| Other values (22) | 14326 |
Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | 11.8% |
| Missing | 601383 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 35 |
| Mean length | 31.20588235 |
| Min length | 5 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 5.9% |
Sample
| 1st row | Garmin Etrex Vista HCX, Datum WGS84 |
|---|---|
| 2nd row | Garmin Etrex Vista HCX, Datum WGS84 |
| 3rd row | Garmin Etrex Vista HCX, Datum WGS84 |
| 4th row | Garmin Etrex Vista HCX, Datum WGS84 |
| 5th row | Garmin Etrex Vista HCX, Datum WGS84 |
| Value | Count | Frequency (%) |
| garmin | 54 | |
| etrex | 54 | |
| vista | 54 | |
| hcx | 54 | |
| datum | 54 | |
| wgs84 | 54 | |
| camp | 7 | 2.0% |
| coordinates | 7 | 2.0% |
| for | 6 | 1.7% |
| longitude | 2 | 0.6% |
| Other values (7) | 12 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 290 | 13.7% | |
| a | 184 | 8.7% |
| t | 175 | 8.2% |
| r | 132 | 6.2% |
| i | 123 | 5.8% |
| m | 118 | 5.6% |
| G | 108 | 5.1% |
| e | 73 | 3.4% |
| n | 67 | 3.2% |
| s | 62 | 2.9% |
| Other values (24) | 790 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1118 | |
| Uppercase Letter | 551 | |
| Space Separator | 290 | 13.7% |
| Decimal Number | 108 | 5.1% |
| Other Punctuation | 55 | 2.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 184 | |
| t | 175 | |
| r | 132 | |
| i | 123 | |
| m | 118 | |
| e | 73 | 6.5% |
| n | 67 | 6.0% |
| s | 62 | 5.5% |
| u | 58 | 5.2% |
| x | 56 | 5.0% |
| Other values (8) | 70 | 6.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 108 | |
| C | 61 | |
| S | 54 | |
| W | 54 | |
| D | 54 | |
| X | 54 | |
| H | 54 | |
| V | 54 | |
| E | 54 | |
| L | 2 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 54 | |
| 8 | 54 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 54 | |
| ; | 1 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 290 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1669 | |
| Common | 453 | 21.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 184 | 11.0% |
| t | 175 | 10.5% |
| r | 132 | 7.9% |
| i | 123 | 7.4% |
| m | 118 | 7.1% |
| G | 108 | 6.5% |
| e | 73 | 4.4% |
| n | 67 | 4.0% |
| s | 62 | 3.7% |
| C | 61 | 3.7% |
| Other values (19) | 566 |
Common
| Value | Count | Frequency (%) |
| 290 | ||
| 4 | 54 | 11.9% |
| 8 | 54 | 11.9% |
| , | 54 | 11.9% |
| ; | 1 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2122 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 290 | 13.7% | |
| a | 184 | 8.7% |
| t | 175 | 8.2% |
| r | 132 | 6.2% |
| i | 123 | 5.8% |
| m | 118 | 5.6% |
| G | 108 | 5.1% |
| e | 73 | 3.4% |
| n | 67 | 3.2% |
| s | 62 | 2.9% |
| Other values (24) | 790 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 599947 |
| Missing (%) | 99.7% |
| Memory size | 4.6 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.412234043 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | uncertain |
|---|---|
| 2nd row | uncertain |
| 3rd row | uncertain |
| 4th row | uncertain |
| 5th row | cf. |
| Value | Count | Frequency (%) |
| uncertain | 1355 | |
| cf | 147 | 9.8% |
| sp | 2 | 0.1% |
| near | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 2712 | |
| c | 1502 | |
| e | 1357 | |
| r | 1357 | |
| a | 1357 | |
| t | 1355 | |
| i | 1355 | |
| u | 1315 | |
| . | 149 | 1.2% |
| f | 147 | 1.2% |
| Other values (4) | 46 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12461 | |
| Other Punctuation | 149 | 1.2% |
| Uppercase Letter | 40 | 0.3% |
| Space Separator | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 2712 | |
| c | 1502 | |
| e | 1357 | |
| r | 1357 | |
| a | 1357 | |
| t | 1355 | |
| i | 1355 | |
| u | 1315 | |
| f | 147 | 1.2% |
| s | 2 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 149 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 40 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12501 | |
| Common | 151 | 1.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 2712 | |
| c | 1502 | |
| e | 1357 | |
| r | 1357 | |
| a | 1357 | |
| t | 1355 | |
| i | 1355 | |
| u | 1315 | |
| f | 147 | 1.2% |
| U | 40 | 0.3% |
| Other values (2) | 4 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| . | 149 | |
| 2 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12652 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 2712 | |
| c | 1502 | |
| e | 1357 | |
| r | 1357 | |
| a | 1357 | |
| t | 1355 | |
| i | 1355 | |
| u | 1315 | |
| . | 149 | 1.2% |
| f | 147 | 1.2% |
| Other values (4) | 46 | 0.4% |
typeStatus
Text
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 597715 |
| Missing (%) | 99.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 4 |
| Mean length | 4.176391863 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | LECTOTYPE |
|---|---|
| 2nd row | TYPE |
| 3rd row | TYPE |
| 4th row | TYPE |
| 5th row | TYPE |
| Value | Count | Frequency (%) |
| type | 3565 | |
| syntype | 80 | 2.1% |
| lectotype | 67 | 1.8% |
| neotype | 12 | 0.3% |
| holotype | 12 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 3816 | |
| E | 3815 | |
| T | 3803 | |
| P | 3736 | |
| O | 103 | 0.7% |
| N | 92 | 0.6% |
| S | 80 | 0.5% |
| L | 79 | 0.5% |
| C | 67 | 0.4% |
| H | 12 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15603 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 3816 | |
| E | 3815 | |
| T | 3803 | |
| P | 3736 | |
| O | 103 | 0.7% |
| N | 92 | 0.6% |
| S | 80 | 0.5% |
| L | 79 | 0.5% |
| C | 67 | 0.4% |
| H | 12 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15603 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 3816 | |
| E | 3815 | |
| T | 3803 | |
| P | 3736 | |
| O | 103 | 0.7% |
| N | 92 | 0.6% |
| S | 80 | 0.5% |
| L | 79 | 0.5% |
| C | 67 | 0.4% |
| H | 12 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15603 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 3816 | |
| E | 3815 | |
| T | 3803 | |
| P | 3736 | |
| O | 103 | 0.7% |
| N | 92 | 0.6% |
| S | 80 | 0.5% |
| L | 79 | 0.5% |
| C | 67 | 0.4% |
| H | 12 | 0.1% |
identifiedBy
Text
Missing 
| Distinct | 95 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 593267 |
| Missing (%) | 98.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 132 |
|---|---|
| Median length | 124 |
| Mean length | 94.36840176 |
| Min length | 10 |
Unique
| Unique | 21 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | O'Neill, Jennifer K., Fort Hayes State University |
|---|---|
| 2nd row | Gardner, Alfred L., Curator (USGS), United States Geological Survey (UNITED STATES) |
| 3rd row | Woodman, Neal, (USGS), United States Geological Survey (UNITED STATES) |
| 4th row | Lunde, Darrin P., Collections Manager (MAM), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| 5th row | Reeder, DeeAnn M., Bucknell University (UNITED STATES) |
| Value | Count | Frequency (%) |
| states | 8033 | 7.9% |
| united | 8033 | 7.9% |
| of | 5420 | 5.3% |
| museum | 5255 | 5.2% |
| natural | 5077 | 5.0% |
| history | 5077 | 5.0% |
| national | 5064 | 5.0% |
| smithsonian | 5007 | 4.9% |
| institution | 5007 | 4.9% |
| 4859 | 4.8% | |
| Other values (272) | 44753 |
Most occurring characters
| Value | Count | Frequency (%) |
| 93401 | 12.1% | |
| t | 49895 | 6.5% |
| o | 47659 | 6.2% |
| i | 45409 | 5.9% |
| a | 41696 | 5.4% |
| e | 39504 | 5.1% |
| n | 38647 | 5.0% |
| s | 36580 | 4.7% |
| r | 29451 | 3.8% |
| u | 25575 | 3.3% |
| Other values (48) | 324494 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 444828 | |
| Uppercase Letter | 174361 | 22.6% |
| Space Separator | 93401 | 12.1% |
| Other Punctuation | 28613 | 3.7% |
| Open Punctuation | 13070 | 1.7% |
| Close Punctuation | 13070 | 1.7% |
| Dash Punctuation | 4968 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 49895 | |
| o | 47659 | |
| i | 45409 | |
| a | 41696 | |
| e | 39504 | |
| n | 38647 | |
| s | 36580 | |
| r | 29451 | |
| u | 25575 | 5.7% |
| l | 25043 | 5.6% |
| Other values (15) | 65369 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 24263 | |
| T | 20751 | |
| M | 20290 | |
| N | 18192 | |
| E | 15013 | |
| A | 13295 | |
| I | 12131 | |
| U | 9985 | |
| D | 9079 | 5.2% |
| H | 8379 | 4.8% |
| Other values (14) | 22983 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 22186 | |
| . | 6353 | 22.2% |
| ' | 69 | 0.2% |
| ; | 4 | < 0.1% |
| & | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 93401 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 13070 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 13070 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4968 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 619189 | |
| Common | 153122 | 19.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 49895 | 8.1% |
| o | 47659 | 7.7% |
| i | 45409 | 7.3% |
| a | 41696 | 6.7% |
| e | 39504 | 6.4% |
| n | 38647 | 6.2% |
| s | 36580 | 5.9% |
| r | 29451 | 4.8% |
| u | 25575 | 4.1% |
| l | 25043 | 4.0% |
| Other values (39) | 239730 |
Common
| Value | Count | Frequency (%) |
| 93401 | ||
| , | 22186 | 14.5% |
| ( | 13070 | 8.5% |
| ) | 13070 | 8.5% |
| . | 6353 | 4.1% |
| - | 4968 | 3.2% |
| ' | 69 | < 0.1% |
| ; | 4 | < 0.1% |
| & | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 772311 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 93401 | 12.1% | |
| t | 49895 | 6.5% |
| o | 47659 | 6.2% |
| i | 45409 | 5.9% |
| a | 41696 | 5.4% |
| e | 39504 | 5.1% |
| n | 38647 | 5.0% |
| s | 36580 | 4.7% |
| r | 29451 | 3.8% |
| u | 25575 | 3.3% |
| Other values (48) | 324494 |
| Distinct | 6815 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.999645856 |
| Min length | 2 |
Unique
| Unique | 793 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 2433573 |
|---|---|
| 2nd row | 2438621 |
| 3rd row | 2433177 |
| 4th row | 2438034 |
| 5th row | 2440447 |
| Value | Count | Frequency (%) |
| 2437967 | 14724 | 2.4% |
| 2440447 | 11867 | 2.0% |
| 2438904 | 8874 | 1.5% |
| 2433176 | 8329 | 1.4% |
| 2438019 | 7347 | 1.2% |
| 2438655 | 6840 | 1.1% |
| 2433272 | 5470 | 0.9% |
| 2439270 | 5412 | 0.9% |
| 2437782 | 5206 | 0.9% |
| 4264939 | 4687 | 0.8% |
| Other values (6805) | 522695 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 695586 | |
| 4 | 682390 | |
| 3 | 540567 | |
| 6 | 455861 | |
| 7 | 389135 | |
| 1 | 323452 | |
| 8 | 317316 | |
| 9 | 293353 | |
| 5 | 267471 | 6.4% |
| 0 | 244813 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4209944 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 695586 | |
| 4 | 682390 | |
| 3 | 540567 | |
| 6 | 455861 | |
| 7 | 389135 | |
| 1 | 323452 | |
| 8 | 317316 | |
| 9 | 293353 | |
| 5 | 267471 | 6.4% |
| 0 | 244813 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4209944 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 695586 | |
| 4 | 682390 | |
| 3 | 540567 | |
| 6 | 455861 | |
| 7 | 389135 | |
| 1 | 323452 | |
| 8 | 317316 | |
| 9 | 293353 | |
| 5 | 267471 | 6.4% |
| 0 | 244813 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4209944 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 695586 | |
| 4 | 682390 | |
| 3 | 540567 | |
| 6 | 455861 | |
| 7 | 389135 | |
| 1 | 323452 | |
| 8 | 317316 | |
| 9 | 293353 | |
| 5 | 267471 | 6.4% |
| 0 | 244813 | 5.8% |
scientificName
Text
| Distinct | 7326 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 147 |
|---|---|
| Median length | 72 |
| Mean length | 35.02832483 |
| Min length | 7 |
Unique
| Unique | 849 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Potos flavus (Schreber, 1774) |
|---|---|
| 2nd row | Microtus longicaudus longicaudus |
| 3rd row | Carollia brevicaudum (Schinz, 1821) |
| 4th row | Peromyscus mexicanus totontepecus Merriam, 1898 |
| 5th row | Tursiops truncatus (Montagu, 1821) |
| Value | Count | Frequency (%) |
| linnaeus | 52995 | 2.1% |
| 1758 | 48641 | 1.9% |
| thomas | 44736 | 1.8% |
| peromyscus | 38753 | 1.5% |
| merriam | 29181 | 1.2% |
| 25993 | 1.0% | |
| rattus | 21929 | 0.9% |
| 1821 | 21801 | 0.9% |
| microtus | 19877 | 0.8% |
| j.a.allen | 18118 | 0.7% |
| Other values (6496) | 2183955 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1904528 | 9.0% | |
| s | 1657434 | 7.9% |
| a | 1400545 | 6.6% |
| i | 1320831 | 6.3% |
| e | 1211612 | 5.8% |
| r | 1087984 | 5.2% |
| u | 1063056 | 5.0% |
| o | 1056373 | 5.0% |
| n | 919028 | 4.4% |
| l | 817723 | 3.9% |
| Other values (70) | 8628707 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14532612 | |
| Decimal Number | 2105528 | 10.0% |
| Space Separator | 1904528 | 9.0% |
| Uppercase Letter | 1252548 | 5.9% |
| Other Punctuation | 638107 | 3.0% |
| Close Punctuation | 313875 | 1.5% |
| Open Punctuation | 313875 | 1.5% |
| Dash Punctuation | 6748 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 1657434 | |
| a | 1400545 | |
| i | 1320831 | |
| e | 1211612 | 8.3% |
| r | 1087984 | 7.5% |
| u | 1063056 | 7.3% |
| o | 1056373 | 7.3% |
| n | 919028 | 6.3% |
| l | 817723 | 5.6% |
| t | 702630 | 4.8% |
| Other values (24) | 3295396 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 175044 | |
| P | 119561 | |
| T | 111746 | 8.9% |
| S | 105807 | 8.4% |
| L | 95529 | 7.6% |
| A | 94300 | 7.5% |
| G | 80592 | 6.4% |
| C | 73937 | 5.9% |
| B | 55966 | 4.5% |
| R | 50699 | 4.0% |
| Other values (18) | 289367 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 644905 | |
| 8 | 441758 | |
| 9 | 227519 | 10.8% |
| 7 | 162226 | 7.7% |
| 5 | 138662 | 6.6% |
| 0 | 127816 | 6.1% |
| 4 | 100538 | 4.8% |
| 3 | 96034 | 4.6% |
| 2 | 89296 | 4.2% |
| 6 | 76774 | 3.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 528229 | |
| . | 83107 | 13.0% |
| & | 25993 | 4.1% |
| ' | 778 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1904528 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 313875 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 313875 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6748 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15785160 | |
| Common | 5282661 | 25.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 1657434 | 10.5% |
| a | 1400545 | 8.9% |
| i | 1320831 | 8.4% |
| e | 1211612 | 7.7% |
| r | 1087984 | 6.9% |
| u | 1063056 | 6.7% |
| o | 1056373 | 6.7% |
| n | 919028 | 5.8% |
| l | 817723 | 5.2% |
| t | 702630 | 4.5% |
| Other values (52) | 4547944 |
Common
| Value | Count | Frequency (%) |
| 1904528 | ||
| 1 | 644905 | 12.2% |
| , | 528229 | 10.0% |
| 8 | 441758 | 8.4% |
| ) | 313875 | 5.9% |
| ( | 313875 | 5.9% |
| 9 | 227519 | 4.3% |
| 7 | 162226 | 3.1% |
| 5 | 138662 | 2.6% |
| 0 | 127816 | 2.4% |
| Other values (8) | 479268 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21054857 | |
| None | 12964 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1904528 | 9.0% | |
| s | 1657434 | 7.9% |
| a | 1400545 | 6.7% |
| i | 1320831 | 6.3% |
| e | 1211612 | 5.8% |
| r | 1087984 | 5.2% |
| u | 1063056 | 5.0% |
| o | 1056373 | 5.0% |
| n | 919028 | 4.4% |
| l | 817723 | 3.9% |
| Other values (60) | 8615743 |
None
| Value | Count | Frequency (%) |
| ü | 5095 | |
| É | 4244 | |
| é | 1615 | 12.5% |
| è | 1387 | 10.7% |
| ö | 331 | 2.6% |
| á | 96 | 0.7% |
| ñ | 78 | 0.6% |
| í | 70 | 0.5% |
| Ä | 24 | 0.2% |
| ä | 24 | 0.2% |
| Distinct | 253 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 121 |
|---|---|
| Median length | 113 |
| Mean length | 90.64064651 |
| Min length | 11 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia, Chordata, Vertebrata, Mammalia, Eutheria, Carnivora, Caniformia, Procyonidae |
|---|---|
| 2nd row | Animalia, Chordata, Vertebrata, Mammalia, Eutheria, Rodentia, Myomorpha, Cricetidae, Arvicolinae |
| 3rd row | Animalia, Chordata, Vertebrata, Mammalia, Eutheria, Chiroptera, Phyllostomidae, Carolliinae |
| 4th row | Animalia, Chordata, Vertebrata, Mammalia, Eutheria, Rodentia, Myomorpha, Cricetidae, Neotominae |
| 5th row | Animalia, Chordata, Vertebrata, Mammalia, Eutheria, Cetacea, Odontoceti, Delphinidae |
| Value | Count | Frequency (%) |
| animalia | 601442 | |
| vertebrata | 601442 | |
| chordata | 601442 | |
| mammalia | 601441 | |
| eutheria | 593341 | |
| rodentia | 297636 | 5.9% |
| myomorpha | 209417 | 4.1% |
| chiroptera | 129086 | 2.5% |
| cricetidae | 107243 | 2.1% |
| muridae | 93911 | 1.9% |
| Other values (328) | 1234181 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8383067 | |
| i | 4797600 | 8.8% |
| , | 4469138 | 8.2% |
| 4469138 | 8.2% | |
| e | 4068524 | 7.5% |
| r | 4037606 | 7.4% |
| t | 3533330 | 6.5% |
| o | 2704288 | 5.0% |
| m | 2453478 | 4.5% |
| h | 1861673 | 3.4% |
| Other values (38) | 13737431 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 40506415 | |
| Uppercase Letter | 5070582 | 9.3% |
| Other Punctuation | 4469138 | 8.2% |
| Space Separator | 4469138 | 8.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8383067 | |
| i | 4797600 | |
| e | 4068524 | |
| r | 4037606 | |
| t | 3533330 | |
| o | 2704288 | 6.7% |
| m | 2453478 | 6.1% |
| h | 1861673 | 4.6% |
| n | 1678993 | 4.1% |
| l | 1675363 | 4.1% |
| Other values (14) | 5312493 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1067853 | |
| M | 1065447 | |
| A | 654487 | |
| V | 641586 | |
| E | 615945 | |
| R | 302881 | 6.0% |
| S | 237180 | 4.7% |
| P | 112443 | 2.2% |
| D | 65158 | 1.3% |
| N | 62146 | 1.2% |
| Other values (12) | 245456 | 4.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4469138 |
Space Separator
| Value | Count | Frequency (%) |
| 4469138 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 45576997 | |
| Common | 8938276 | 16.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8383067 | |
| i | 4797600 | |
| e | 4068524 | 8.9% |
| r | 4037606 | 8.9% |
| t | 3533330 | 7.8% |
| o | 2704288 | 5.9% |
| m | 2453478 | 5.4% |
| h | 1861673 | 4.1% |
| n | 1678993 | 3.7% |
| l | 1675363 | 3.7% |
| Other values (36) | 10383075 |
Common
| Value | Count | Frequency (%) |
| , | 4469138 | |
| 4469138 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 54515273 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8383067 | |
| i | 4797600 | 8.8% |
| , | 4469138 | 8.2% |
| 4469138 | 8.2% | |
| e | 4068524 | 7.5% |
| r | 4037606 | 7.4% |
| t | 3533330 | 6.5% |
| o | 2704288 | 5.0% |
| m | 2453478 | 4.5% |
| h | 1861673 | 3.4% |
| Other values (38) | 13737431 |
kingdom
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 601451 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1202902 | |
| a | 1202902 | |
| A | 601451 | |
| n | 601451 | |
| m | 601451 | |
| l | 601451 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4210157 | |
| Uppercase Letter | 601451 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1202902 | |
| a | 1202902 | |
| n | 601451 | |
| m | 601451 | |
| l | 601451 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 601451 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4811608 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1202902 | |
| a | 1202902 | |
| A | 601451 | |
| n | 601451 | |
| m | 601451 | |
| l | 601451 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4811608 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1202902 | |
| a | 1202902 | |
| A | 601451 | |
| n | 601451 | |
| m | 601451 | |
| l | 601451 |
phylum
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Chordata |
|---|---|
| 2nd row | Chordata |
| 3rd row | Chordata |
| 4th row | Chordata |
| 5th row | Chordata |
| Value | Count | Frequency (%) |
| chordata | 601449 | |
| mollusca | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1202900 | |
| o | 601451 | |
| C | 601449 | |
| h | 601449 | |
| r | 601449 | |
| d | 601449 | |
| t | 601449 | |
| l | 4 | < 0.1% |
| M | 2 | < 0.1% |
| u | 2 | < 0.1% |
| Other values (2) | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4210157 | |
| Uppercase Letter | 601451 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1202900 | |
| o | 601451 | |
| h | 601449 | |
| r | 601449 | |
| d | 601449 | |
| t | 601449 | |
| l | 4 | < 0.1% |
| u | 2 | < 0.1% |
| s | 2 | < 0.1% |
| c | 2 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 601449 | |
| M | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4811608 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1202900 | |
| o | 601451 | |
| C | 601449 | |
| h | 601449 | |
| r | 601449 | |
| d | 601449 | |
| t | 601449 | |
| l | 4 | < 0.1% |
| M | 2 | < 0.1% |
| u | 2 | < 0.1% |
| Other values (2) | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4811608 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1202900 | |
| o | 601451 | |
| C | 601449 | |
| h | 601449 | |
| r | 601449 | |
| d | 601449 | |
| t | 601449 | |
| l | 4 | < 0.1% |
| M | 2 | < 0.1% |
| u | 2 | < 0.1% |
| Other values (2) | 4 | < 0.1% |
class
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 8 |
| Mean length | 8.000006651 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mammalia |
|---|---|
| 2nd row | Mammalia |
| 3rd row | Mammalia |
| 4th row | Mammalia |
| 5th row | Mammalia |
| Value | Count | Frequency (%) |
| mammalia | 601448 | |
| gastropoda | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1804348 | |
| m | 1202896 | |
| M | 601448 | 12.5% |
| l | 601448 | 12.5% |
| i | 601448 | 12.5% |
| o | 4 | < 0.1% |
| G | 2 | < 0.1% |
| s | 2 | < 0.1% |
| t | 2 | < 0.1% |
| r | 2 | < 0.1% |
| Other values (2) | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4210154 | |
| Uppercase Letter | 601450 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1804348 | |
| m | 1202896 | |
| l | 601448 | 14.3% |
| i | 601448 | 14.3% |
| o | 4 | < 0.1% |
| s | 2 | < 0.1% |
| t | 2 | < 0.1% |
| r | 2 | < 0.1% |
| p | 2 | < 0.1% |
| d | 2 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 601448 | |
| G | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4811604 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1804348 | |
| m | 1202896 | |
| M | 601448 | 12.5% |
| l | 601448 | 12.5% |
| i | 601448 | 12.5% |
| o | 4 | < 0.1% |
| G | 2 | < 0.1% |
| s | 2 | < 0.1% |
| t | 2 | < 0.1% |
| r | 2 | < 0.1% |
| Other values (2) | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4811604 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1804348 | |
| m | 1202896 | |
| M | 601448 | 12.5% |
| l | 601448 | 12.5% |
| i | 601448 | 12.5% |
| o | 4 | < 0.1% |
| G | 2 | < 0.1% |
| s | 2 | < 0.1% |
| t | 2 | < 0.1% |
| r | 2 | < 0.1% |
| Other values (2) | 4 | < 0.1% |
order
Text
| Distinct | 29 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 8 |
| Mean length | 8.868951264 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Carnivora |
|---|---|
| 2nd row | Rodentia |
| 3rd row | Chiroptera |
| 4th row | Rodentia |
| 5th row | Cetacea |
| Value | Count | Frequency (%) |
| rodentia | 297636 | |
| chiroptera | 129084 | |
| cetacea | 47588 | 7.9% |
| carnivora | 47294 | 7.9% |
| soricomorpha | 30383 | 5.1% |
| lagomorpha | 11977 | 2.0% |
| artiodactyla | 11375 | 1.9% |
| primates | 10781 | 1.8% |
| didelphimorphia | 5645 | 0.9% |
| diprotodontia | 1652 | 0.3% |
| Other values (19) | 8033 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 725656 | |
| o | 618974 | |
| i | 555654 | |
| e | 546103 | |
| t | 514236 | |
| r | 462546 | |
| n | 351518 | |
| d | 320914 | |
| R | 297636 | |
| C | 224385 | 4.2% |
| Other values (22) | 716591 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4732765 | |
| Uppercase Letter | 601448 | 11.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 725656 | |
| o | 618974 | |
| i | 555654 | |
| e | 546103 | |
| t | 514236 | |
| r | 462546 | |
| n | 351518 | |
| d | 320914 | |
| p | 186078 | 3.9% |
| h | 184415 | 3.9% |
| Other values (10) | 266671 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 297636 | |
| C | 224385 | |
| S | 32307 | 5.4% |
| P | 12509 | 2.1% |
| A | 12049 | 2.0% |
| L | 11977 | 2.0% |
| D | 7786 | 1.3% |
| M | 1503 | 0.2% |
| E | 940 | 0.2% |
| H | 341 | 0.1% |
| Other values (2) | 15 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5334213 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 725656 | |
| o | 618974 | |
| i | 555654 | |
| e | 546103 | |
| t | 514236 | |
| r | 462546 | |
| n | 351518 | |
| d | 320914 | |
| R | 297636 | |
| C | 224385 | 4.2% |
| Other values (22) | 716591 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5334213 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 725656 | |
| o | 618974 | |
| i | 555654 | |
| e | 546103 | |
| t | 514236 | |
| r | 462546 | |
| n | 351518 | |
| d | 320914 | |
| R | 297636 | |
| C | 224385 | 4.2% |
| Other values (22) | 716591 |
family
Text
| Distinct | 158 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1158 |
| Missing (%) | 0.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 16 |
| Mean length | 10.24363436 |
| Min length | 6 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Procyonidae |
|---|---|
| 2nd row | Cricetidae |
| 3rd row | Phyllostomidae |
| 4th row | Cricetidae |
| 5th row | Delphinidae |
| Value | Count | Frequency (%) |
| cricetidae | 107243 | |
| muridae | 93911 | |
| phyllostomidae | 55530 | 9.3% |
| sciuridae | 46130 | 7.7% |
| soricidae | 27470 | 4.6% |
| delphinidae | 23642 | 3.9% |
| vespertilionidae | 22260 | 3.7% |
| heteromyidae | 19997 | 3.3% |
| molossidae | 13560 | 2.3% |
| canidae | 12559 | 2.1% |
| Other values (148) | 177991 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 944314 | |
| i | 915778 | |
| a | 664057 | |
| d | 635413 | |
| r | 412407 | 6.7% |
| o | 362011 | 5.9% |
| t | 276360 | 4.5% |
| l | 229210 | 3.7% |
| c | 221440 | 3.6% |
| u | 159982 | 2.6% |
| Other values (32) | 1328210 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5548889 | |
| Uppercase Letter | 600293 | 9.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 944314 | |
| i | 915778 | |
| a | 664057 | |
| d | 635413 | |
| r | 412407 | |
| o | 362011 | 6.5% |
| t | 276360 | 5.0% |
| l | 229210 | 4.1% |
| c | 221440 | 4.0% |
| u | 159982 | 2.9% |
| Other values (12) | 727917 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 134036 | |
| M | 133645 | |
| P | 81100 | |
| S | 74875 | |
| D | 35147 | 5.9% |
| H | 30149 | 5.0% |
| V | 23469 | 3.9% |
| B | 14305 | 2.4% |
| G | 12230 | 2.0% |
| E | 11823 | 2.0% |
| Other values (10) | 49514 | 8.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6149182 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 944314 | |
| i | 915778 | |
| a | 664057 | |
| d | 635413 | |
| r | 412407 | 6.7% |
| o | 362011 | 5.9% |
| t | 276360 | 4.5% |
| l | 229210 | 3.7% |
| c | 221440 | 3.6% |
| u | 159982 | 2.6% |
| Other values (32) | 1328210 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6149182 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 944314 | |
| i | 915778 | |
| a | 664057 | |
| d | 635413 | |
| r | 412407 | 6.7% |
| o | 362011 | 5.9% |
| t | 276360 | 4.5% |
| l | 229210 | 3.7% |
| c | 221440 | 3.6% |
| u | 159982 | 2.6% |
| Other values (32) | 1328210 |
genus
Text
| Distinct | 1129 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1999 |
| Missing (%) | 0.3% |
| Memory size | 4.6 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 8.505269813 |
| Min length | 2 |
Unique
| Unique | 62 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Potos |
|---|---|
| 2nd row | Microtus |
| 3rd row | Carollia |
| 4th row | Peromyscus |
| 5th row | Tursiops |
| Value | Count | Frequency (%) |
| peromyscus | 38753 | 6.5% |
| microtus | 19877 | 3.3% |
| rattus | 16463 | 2.7% |
| sorex | 15826 | 2.6% |
| artibeus | 12467 | 2.1% |
| carollia | 12281 | 2.0% |
| tursiops | 11894 | 2.0% |
| tamias | 11871 | 2.0% |
| mastomys | 11447 | 1.9% |
| mus | 10554 | 1.8% |
| Other values (1119) | 438019 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 603875 | 11.8% |
| o | 509579 | 10.0% |
| a | 348715 | 6.8% |
| r | 348305 | 6.8% |
| u | 337607 | 6.6% |
| i | 326618 | 6.4% |
| e | 313769 | 6.2% |
| t | 248513 | 4.9% |
| l | 221347 | 4.3% |
| y | 216014 | 4.2% |
| Other values (40) | 1624159 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4499049 | |
| Uppercase Letter | 599452 | 11.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 603875 | |
| o | 509579 | |
| a | 348715 | 7.8% |
| r | 348305 | 7.7% |
| u | 337607 | 7.5% |
| i | 326618 | 7.3% |
| e | 313769 | 7.0% |
| t | 248513 | 5.5% |
| l | 221347 | 4.9% |
| y | 216014 | 4.8% |
| Other values (16) | 1024707 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 103829 | |
| P | 84771 | |
| C | 58032 | |
| S | 54440 | |
| T | 51640 | |
| A | 33284 | 5.6% |
| R | 31059 | 5.2% |
| G | 28169 | 4.7% |
| L | 22997 | 3.8% |
| D | 21683 | 3.6% |
| Other values (14) | 109548 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5098501 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 603875 | 11.8% |
| o | 509579 | 10.0% |
| a | 348715 | 6.8% |
| r | 348305 | 6.8% |
| u | 337607 | 6.6% |
| i | 326618 | 6.4% |
| e | 313769 | 6.2% |
| t | 248513 | 4.9% |
| l | 221347 | 4.3% |
| y | 216014 | 4.2% |
| Other values (40) | 1624159 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5098501 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 603875 | 11.8% |
| o | 509579 | 10.0% |
| a | 348715 | 6.8% |
| r | 348305 | 6.8% |
| u | 337607 | 6.6% |
| i | 326618 | 6.4% |
| e | 313769 | 6.2% |
| t | 248513 | 4.9% |
| l | 221347 | 4.3% |
| y | 216014 | 4.2% |
| Other values (40) | 1624159 |
genericName
Text
| Distinct | 1115 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 2002 |
| Missing (%) | 0.3% |
| Memory size | 4.6 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 8.504708491 |
| Min length | 2 |
Unique
| Unique | 64 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Potos |
|---|---|
| 2nd row | Microtus |
| 3rd row | Carollia |
| 4th row | Peromyscus |
| 5th row | Tursiops |
| Value | Count | Frequency (%) |
| peromyscus | 38753 | 6.5% |
| microtus | 19877 | 3.3% |
| rattus | 16463 | 2.7% |
| sorex | 15826 | 2.6% |
| artibeus | 12470 | 2.1% |
| carollia | 12281 | 2.0% |
| tursiops | 11894 | 2.0% |
| tamias | 11871 | 2.0% |
| mastomys | 11447 | 1.9% |
| mus | 10554 | 1.8% |
| Other values (1105) | 438013 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 603400 | 11.8% |
| o | 512945 | 10.1% |
| r | 348628 | 6.8% |
| a | 347937 | 6.8% |
| u | 335989 | 6.6% |
| i | 330068 | 6.5% |
| e | 312892 | 6.1% |
| t | 245783 | 4.8% |
| l | 219644 | 4.3% |
| m | 215952 | 4.2% |
| Other values (40) | 1624901 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4498690 | |
| Uppercase Letter | 599449 | 11.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 603400 | |
| o | 512945 | |
| r | 348628 | 7.7% |
| a | 347937 | 7.7% |
| u | 335989 | 7.5% |
| i | 330068 | 7.3% |
| e | 312892 | 7.0% |
| t | 245783 | 5.5% |
| l | 219644 | 4.9% |
| m | 215952 | 4.8% |
| Other values (16) | 1025452 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 102791 | |
| P | 84544 | |
| C | 58079 | |
| S | 54592 | |
| T | 51641 | |
| A | 32571 | 5.4% |
| R | 31084 | 5.2% |
| G | 28179 | 4.7% |
| L | 23170 | 3.9% |
| N | 23069 | 3.8% |
| Other values (14) | 109729 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5098139 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 603400 | 11.8% |
| o | 512945 | 10.1% |
| r | 348628 | 6.8% |
| a | 347937 | 6.8% |
| u | 335989 | 6.6% |
| i | 330068 | 6.5% |
| e | 312892 | 6.1% |
| t | 245783 | 4.8% |
| l | 219644 | 4.3% |
| m | 215952 | 4.2% |
| Other values (40) | 1624901 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5098139 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 603400 | 11.8% |
| o | 512945 | 10.1% |
| r | 348628 | 6.8% |
| a | 347937 | 6.8% |
| u | 335989 | 6.6% |
| i | 330068 | 6.5% |
| e | 312892 | 6.1% |
| t | 245783 | 4.8% |
| l | 219644 | 4.3% |
| m | 215952 | 4.2% |
| Other values (40) | 1624901 |
specificEpithet
Text
Missing 
| Distinct | 2771 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 29657 |
| Missing (%) | 4.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 14 |
| Mean length | 8.673424345 |
| Min length | 2 |
Unique
| Unique | 258 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | flavus |
|---|---|
| 2nd row | longicaudus |
| 3rd row | brevicaudum |
| 4th row | mexicanus |
| 5th row | truncatus |
| Value | Count | Frequency (%) |
| maniculatus | 15647 | 2.7% |
| truncatus | 11873 | 2.1% |
| musculus | 8519 | 1.5% |
| perspicillata | 8339 | 1.5% |
| leucopus | 7382 | 1.3% |
| pennsylvanicus | 6799 | 1.2% |
| jamaicensis | 5581 | 1.0% |
| brevicauda | 5546 | 1.0% |
| rattus | 5466 | 1.0% |
| cinereus | 4761 | 0.8% |
| Other values (2761) | 491881 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 572805 | |
| i | 552816 | |
| a | 502230 | |
| u | 462190 | |
| e | 328983 | 6.6% |
| r | 327562 | 6.6% |
| n | 325551 | 6.6% |
| l | 286376 | 5.8% |
| t | 270290 | 5.5% |
| c | 258935 | 5.2% |
| Other values (16) | 1071674 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4959412 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 572805 | |
| i | 552816 | |
| a | 502230 | |
| u | 462190 | |
| e | 328983 | 6.6% |
| r | 327562 | 6.6% |
| n | 325551 | 6.6% |
| l | 286376 | 5.8% |
| t | 270290 | 5.5% |
| c | 258935 | 5.2% |
| Other values (16) | 1071674 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4959412 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 572805 | |
| i | 552816 | |
| a | 502230 | |
| u | 462190 | |
| e | 328983 | 6.6% |
| r | 327562 | 6.6% |
| n | 325551 | 6.6% |
| l | 286376 | 5.8% |
| t | 270290 | 5.5% |
| c | 258935 | 5.2% |
| Other values (16) | 1071674 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4959412 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 572805 | |
| i | 552816 | |
| a | 502230 | |
| u | 462190 | |
| e | 328983 | 6.6% |
| r | 327562 | 6.6% |
| n | 325551 | 6.6% |
| l | 286376 | 5.8% |
| t | 270290 | 5.5% |
| c | 258935 | 5.2% |
| Other values (16) | 1071674 |
Missing 
| Distinct | 2443 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 386527 |
| Missing (%) | 64.3% |
| Memory size | 4.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 15 |
| Mean length | 8.768327409 |
| Min length | 3 |
Unique
| Unique | 226 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | longicaudus |
|---|---|
| 2nd row | totontepecus |
| 3rd row | marinensis |
| 4th row | bairdii |
| 5th row | merriami |
| Value | Count | Frequency (%) |
| domesticus | 4357 | 2.0% |
| pennsylvanicus | 4127 | 1.9% |
| talpoides | 3712 | 1.7% |
| cinereus | 3602 | 1.7% |
| trowbridgii | 2145 | 1.0% |
| merriami | 2051 | 1.0% |
| lestes | 1946 | 0.9% |
| panamensis | 1556 | 0.7% |
| personatus | 1522 | 0.7% |
| mexicana | 1479 | 0.7% |
| Other values (2433) | 188427 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 232184 | |
| i | 220508 | |
| a | 170831 | |
| e | 153788 | 8.2% |
| n | 142641 | 7.6% |
| u | 141648 | 7.5% |
| r | 121518 | 6.4% |
| o | 101734 | 5.4% |
| l | 100611 | 5.3% |
| c | 89089 | 4.7% |
| Other values (16) | 409972 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1884524 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 232184 | |
| i | 220508 | |
| a | 170831 | |
| e | 153788 | 8.2% |
| n | 142641 | 7.6% |
| u | 141648 | 7.5% |
| r | 121518 | 6.4% |
| o | 101734 | 5.4% |
| l | 100611 | 5.3% |
| c | 89089 | 4.7% |
| Other values (16) | 409972 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1884524 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 232184 | |
| i | 220508 | |
| a | 170831 | |
| e | 153788 | 8.2% |
| n | 142641 | 7.6% |
| u | 141648 | 7.5% |
| r | 121518 | 6.4% |
| o | 101734 | 5.4% |
| l | 100611 | 5.3% |
| c | 89089 | 4.7% |
| Other values (16) | 409972 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1884524 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 232184 | |
| i | 220508 | |
| a | 170831 | |
| e | 153788 | 8.2% |
| n | 142641 | 7.6% |
| u | 141648 | 7.5% |
| r | 121518 | 6.4% |
| o | 101734 | 5.4% |
| l | 100611 | 5.3% |
| c | 89089 | 4.7% |
| Other values (16) | 409972 |
taxonRank
Text
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 7.974819229 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | SPECIES |
|---|---|
| 2nd row | SUBSPECIES |
| 3rd row | SPECIES |
| 4th row | SUBSPECIES |
| 5th row | SPECIES |
| Value | Count | Frequency (%) |
| species | 356873 | |
| subspecies | 214924 | |
| genus | 27655 | 4.6% |
| order | 1157 | 0.2% |
| family | 841 | 0.1% |
| phylum | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 1386173 | |
| E | 1172406 | |
| I | 572638 | |
| P | 571798 | |
| C | 571797 | |
| U | 242580 | 5.1% |
| B | 214924 | 4.5% |
| G | 27655 | 0.6% |
| N | 27655 | 0.6% |
| R | 2314 | < 0.1% |
| Other values (8) | 6523 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4796463 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1386173 | |
| E | 1172406 | |
| I | 572638 | |
| P | 571798 | |
| C | 571797 | |
| U | 242580 | 5.1% |
| B | 214924 | 4.5% |
| G | 27655 | 0.6% |
| N | 27655 | 0.6% |
| R | 2314 | < 0.1% |
| Other values (8) | 6523 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4796463 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 1386173 | |
| E | 1172406 | |
| I | 572638 | |
| P | 571798 | |
| C | 571797 | |
| U | 242580 | 5.1% |
| B | 214924 | 4.5% |
| G | 27655 | 0.6% |
| N | 27655 | 0.6% |
| R | 2314 | < 0.1% |
| Other values (8) | 6523 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4796463 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 1386173 | |
| E | 1172406 | |
| I | 572638 | |
| P | 571798 | |
| C | 571797 | |
| U | 242580 | 5.1% |
| B | 214924 | 4.5% |
| G | 27655 | 0.6% |
| N | 27655 | 0.6% |
| R | 2314 | < 0.1% |
| Other values (8) | 6523 | 0.1% |
taxonomicStatus
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.919303484 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | SYNONYM |
| 3rd row | ACCEPTED |
| 4th row | SYNONYM |
| 5th row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 552476 | |
| synonym | 48535 | 8.1% |
| doubtful | 440 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1104952 | |
| E | 1104952 | |
| T | 552916 | |
| D | 552916 | |
| A | 552476 | |
| P | 552476 | |
| Y | 97070 | 2.0% |
| N | 97070 | 2.0% |
| O | 48975 | 1.0% |
| S | 48535 | 1.0% |
| Other values (5) | 50735 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4763073 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1104952 | |
| E | 1104952 | |
| T | 552916 | |
| D | 552916 | |
| A | 552476 | |
| P | 552476 | |
| Y | 97070 | 2.0% |
| N | 97070 | 2.0% |
| O | 48975 | 1.0% |
| S | 48535 | 1.0% |
| Other values (5) | 50735 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4763073 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 1104952 | |
| E | 1104952 | |
| T | 552916 | |
| D | 552916 | |
| A | 552476 | |
| P | 552476 | |
| Y | 97070 | 2.0% |
| N | 97070 | 2.0% |
| O | 48975 | 1.0% |
| S | 48535 | 1.0% |
| Other values (5) | 50735 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4763073 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 1104952 | |
| E | 1104952 | |
| T | 552916 | |
| D | 552916 | |
| A | 552476 | |
| P | 552476 | |
| Y | 97070 | 2.0% |
| N | 97070 | 2.0% |
| O | 48975 | 1.0% |
| S | 48535 | 1.0% |
| Other values (5) | 50735 | 1.1% |
datasetKey
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
|---|---|
| 2nd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 3rd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 4th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 5th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| Value | Count | Frequency (%) |
| 821cc27a-e3bb-4bc5-ac34-89ada245069d | 601451 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 2405804 | |
| a | 2405804 | |
| - | 2405804 | |
| 2 | 1804353 | |
| b | 1804353 | |
| 4 | 1804353 | |
| 8 | 1202902 | 5.6% |
| 3 | 1202902 | 5.6% |
| 5 | 1202902 | 5.6% |
| 9 | 1202902 | 5.6% |
| Other values (6) | 4210157 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10826118 | |
| Lowercase Letter | 8420314 | |
| Dash Punctuation | 2405804 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1804353 | |
| 4 | 1804353 | |
| 8 | 1202902 | |
| 3 | 1202902 | |
| 5 | 1202902 | |
| 9 | 1202902 | |
| 1 | 601451 | 5.6% |
| 7 | 601451 | 5.6% |
| 0 | 601451 | 5.6% |
| 6 | 601451 | 5.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 2405804 | |
| a | 2405804 | |
| b | 1804353 | |
| d | 1202902 | |
| e | 601451 | 7.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2405804 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13231922 | |
| Latin | 8420314 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 2405804 | |
| 2 | 1804353 | |
| 4 | 1804353 | |
| 8 | 1202902 | |
| 3 | 1202902 | |
| 5 | 1202902 | |
| 9 | 1202902 | |
| 1 | 601451 | 4.5% |
| 7 | 601451 | 4.5% |
| 0 | 601451 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| c | 2405804 | |
| a | 2405804 | |
| b | 1804353 | |
| d | 1202902 | |
| e | 601451 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21652236 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 2405804 | |
| a | 2405804 | |
| - | 2405804 | |
| 2 | 1804353 | |
| b | 1804353 | |
| 4 | 1804353 | |
| 8 | 1202902 | 5.6% |
| 3 | 1202902 | 5.6% |
| 5 | 1202902 | 5.6% |
| 9 | 1202902 | 5.6% |
| Other values (6) | 4210157 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 601451 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 601451 | |
| S | 601451 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1202902 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 601451 | |
| S | 601451 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1202902 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 601451 | |
| S | 601451 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1202902 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 601451 | |
| S | 601451 |
lastInterpreted
Text
| Distinct | 185984 |
|---|---|
| Distinct (%) | 30.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99573698 |
| Min length | 20 |
Unique
| Unique | 38937 ? |
|---|---|
| Unique (%) | 6.5% |
Sample
| 1st row | 2024-12-02T13:58:01.255Z |
|---|---|
| 2nd row | 2024-12-02T13:59:38.442Z |
| 3rd row | 2024-12-02T13:56:07.605Z |
| 4th row | 2024-12-02T13:58:24.850Z |
| 5th row | 2024-12-02T13:56:12.476Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:14.377z | 17 | < 0.1% |
| 2024-12-02t13:57:24.313z | 17 | < 0.1% |
| 2024-12-02t13:57:59.063z | 17 | < 0.1% |
| 2024-12-02t13:57:52.813z | 17 | < 0.1% |
| 2024-12-02t13:57:15.231z | 17 | < 0.1% |
| 2024-12-02t13:57:50.062z | 16 | < 0.1% |
| 2024-12-02t13:57:52.024z | 16 | < 0.1% |
| 2024-12-02t13:57:25.776z | 16 | < 0.1% |
| 2024-12-02t13:56:59.760z | 15 | < 0.1% |
| 2024-12-02t13:57:24.391z | 15 | < 0.1% |
| Other values (185974) | 601288 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2746380 | |
| 0 | 1525337 | |
| 1 | 1517832 | |
| - | 1202902 | |
| : | 1202902 | |
| 4 | 967155 | 6.7% |
| 5 | 955236 | 6.6% |
| 3 | 952306 | 6.6% |
| T | 601451 | 4.2% |
| Z | 601451 | 4.2% |
| Other values (5) | 2159308 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10222744 | |
| Other Punctuation | 1803712 | 12.5% |
| Dash Punctuation | 1202902 | 8.3% |
| Uppercase Letter | 1202902 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2746380 | |
| 0 | 1525337 | |
| 1 | 1517832 | |
| 4 | 967155 | 9.5% |
| 5 | 955236 | 9.3% |
| 3 | 952306 | 9.3% |
| 7 | 460995 | 4.5% |
| 9 | 384640 | 3.8% |
| 6 | 362872 | 3.5% |
| 8 | 349991 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1202902 | |
| . | 600810 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 601451 | |
| Z | 601451 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1202902 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13229358 | |
| Latin | 1202902 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2746380 | |
| 0 | 1525337 | |
| 1 | 1517832 | |
| - | 1202902 | |
| : | 1202902 | |
| 4 | 967155 | 7.3% |
| 5 | 955236 | 7.2% |
| 3 | 952306 | 7.2% |
| . | 600810 | 4.5% |
| 7 | 460995 | 3.5% |
| Other values (3) | 1097503 | 8.3% |
Latin
| Value | Count | Frequency (%) |
| T | 601451 | |
| Z | 601451 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14432260 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2746380 | |
| 0 | 1525337 | |
| 1 | 1517832 | |
| - | 1202902 | |
| : | 1202902 | |
| 4 | 967155 | 6.7% |
| 5 | 955236 | 6.6% |
| 3 | 952306 | 6.6% |
| T | 601451 | 4.2% |
| Z | 601451 | 4.2% |
| Other values (5) | 2159308 |
elevation
Text
Missing 
| Distinct | 1569 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 496901 |
| Missing (%) | 82.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.310425634 |
| Min length | 3 |
Unique
| Unique | 448 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 1032.0 |
|---|---|
| 2nd row | 1006.0 |
| 3rd row | 545.0 |
| 4th row | 2134.0 |
| 5th row | 130.0 |
| Value | Count | Frequency (%) |
| 155.0 | 2555 | 2.4% |
| 150.0 | 2080 | 2.0% |
| 975.0 | 1931 | 1.8% |
| 1829.0 | 1920 | 1.8% |
| 1219.0 | 1756 | 1.7% |
| 1524.0 | 1715 | 1.6% |
| 2438.0 | 1448 | 1.4% |
| 2134.0 | 1349 | 1.3% |
| 914.0 | 1245 | 1.2% |
| 610.0 | 1175 | 1.1% |
| Other values (1556) | 87376 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 157702 | |
| . | 104550 | |
| 1 | 64998 | |
| 2 | 42656 | 7.7% |
| 5 | 41946 | 7.6% |
| 3 | 29052 | 5.2% |
| 4 | 25982 | 4.7% |
| 7 | 23929 | 4.3% |
| 9 | 22522 | 4.1% |
| 6 | 21007 | 3.8% |
| Other values (2) | 20861 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 450648 | |
| Other Punctuation | 104550 | 18.8% |
| Dash Punctuation | 7 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 157702 | |
| 1 | 64998 | |
| 2 | 42656 | 9.5% |
| 5 | 41946 | 9.3% |
| 3 | 29052 | 6.4% |
| 4 | 25982 | 5.8% |
| 7 | 23929 | 5.3% |
| 9 | 22522 | 5.0% |
| 6 | 21007 | 4.7% |
| 8 | 20854 | 4.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 104550 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 555205 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 157702 | |
| . | 104550 | |
| 1 | 64998 | |
| 2 | 42656 | 7.7% |
| 5 | 41946 | 7.6% |
| 3 | 29052 | 5.2% |
| 4 | 25982 | 4.7% |
| 7 | 23929 | 4.3% |
| 9 | 22522 | 4.1% |
| 6 | 21007 | 3.8% |
| Other values (2) | 20861 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 555205 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 157702 | |
| . | 104550 | |
| 1 | 64998 | |
| 2 | 42656 | 7.7% |
| 5 | 41946 | 7.6% |
| 3 | 29052 | 5.2% |
| 4 | 25982 | 4.7% |
| 7 | 23929 | 4.3% |
| 9 | 22522 | 4.1% |
| 6 | 21007 | 3.8% |
| Other values (2) | 20861 | 3.8% |
Missing 
| Distinct | 72 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 597572 |
| Missing (%) | 99.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.337200309 |
| Min length | 3 |
Unique
| Unique | 14 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 15.5 |
|---|---|
| 2nd row | 46.0 |
| 3rd row | 30.5 |
| 4th row | 250.0 |
| 5th row | 150.0 |
| Value | Count | Frequency (%) |
| 38.0 | 907 | |
| 150.0 | 345 | 8.9% |
| 250.0 | 244 | 6.3% |
| 304.5 | 236 | 6.1% |
| 120.0 | 156 | 4.0% |
| 46.0 | 152 | 3.9% |
| 76.5 | 149 | 3.8% |
| 100.0 | 145 | 3.7% |
| 15.0 | 122 | 3.1% |
| 37.5 | 108 | 2.8% |
| Other values (62) | 1315 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4442 | |
| . | 3879 | |
| 5 | 2371 | |
| 3 | 1535 | 9.1% |
| 1 | 1270 | 7.5% |
| 8 | 1062 | 6.3% |
| 2 | 857 | 5.1% |
| 4 | 589 | 3.5% |
| 7 | 404 | 2.4% |
| 6 | 395 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12945 | |
| Other Punctuation | 3879 | 23.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4442 | |
| 5 | 2371 | |
| 3 | 1535 | 11.9% |
| 1 | 1270 | 9.8% |
| 8 | 1062 | 8.2% |
| 2 | 857 | 6.6% |
| 4 | 589 | 4.6% |
| 7 | 404 | 3.1% |
| 6 | 395 | 3.1% |
| 9 | 20 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3879 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 16824 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4442 | |
| . | 3879 | |
| 5 | 2371 | |
| 3 | 1535 | 9.1% |
| 1 | 1270 | 7.5% |
| 8 | 1062 | 6.3% |
| 2 | 857 | 5.1% |
| 4 | 589 | 3.5% |
| 7 | 404 | 2.4% |
| 6 | 395 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16824 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4442 | |
| . | 3879 | |
| 5 | 2371 | |
| 3 | 1535 | 9.1% |
| 1 | 1270 | 7.5% |
| 8 | 1062 | 6.3% |
| 2 | 857 | 5.1% |
| 4 | 589 | 3.5% |
| 7 | 404 | 2.4% |
| 6 | 395 | 2.3% |
depth
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 601448 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.666666667 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | 853.0 |
|---|---|
| 2nd row | 1600.0 |
| 3rd row | 1600.0 |
| Value | Count | Frequency (%) |
| 1600.0 | 2 | |
| 853.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7 | |
| . | 3 | |
| 1 | 2 | 11.8% |
| 6 | 2 | 11.8% |
| 8 | 1 | 5.9% |
| 5 | 1 | 5.9% |
| 3 | 1 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 | |
| Other Punctuation | 3 | 17.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7 | |
| 1 | 2 | 14.3% |
| 6 | 2 | 14.3% |
| 8 | 1 | 7.1% |
| 5 | 1 | 7.1% |
| 3 | 1 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7 | |
| . | 3 | |
| 1 | 2 | 11.8% |
| 6 | 2 | 11.8% |
| 8 | 1 | 5.9% |
| 5 | 1 | 5.9% |
| 3 | 1 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7 | |
| . | 3 | |
| 1 | 2 | 11.8% |
| 6 | 2 | 11.8% |
| 8 | 1 | 5.9% |
| 5 | 1 | 5.9% |
| 3 | 1 | 5.9% |
distanceFromCentroidInMeters
Text
Missing 
| Distinct | 34 |
|---|---|
| Distinct (%) | 12.5% |
| Missing | 601180 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 16.32472325 |
| Min length | 3 |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | 7.4% |
Sample
| 1st row | 4411.160071289899 |
|---|---|
| 2nd row | 4411.160071289899 |
| 3rd row | 0.0 |
| 4th row | 4411.160071289899 |
| 5th row | 1895.2753464364682 |
| Value | Count | Frequency (%) |
| 4411.160071289899 | 100 | |
| 918.1358064728217 | 59 | |
| 818.1211019658687 | 23 | 8.5% |
| 0.0 | 16 | 5.9% |
| 1698.8813565505823 | 14 | 5.2% |
| 1895.2753464364682 | 9 | 3.3% |
| 2501.879815645856 | 7 | 2.6% |
| 862.8264353705852 | 5 | 1.8% |
| 1136.4802457515602 | 5 | 1.8% |
| 3374.3891962124544 | 4 | 1.5% |
| Other values (24) | 29 | 10.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 818 | |
| 8 | 634 | |
| 9 | 458 | |
| 4 | 388 | |
| 0 | 384 | |
| 2 | 361 | |
| 6 | 349 | |
| 7 | 316 | 7.1% |
| . | 271 | 6.1% |
| 5 | 263 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4153 | |
| Other Punctuation | 271 | 6.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 818 | |
| 8 | 634 | |
| 9 | 458 | |
| 4 | 388 | |
| 0 | 384 | |
| 2 | 361 | |
| 6 | 349 | |
| 7 | 316 | 7.6% |
| 5 | 263 | 6.3% |
| 3 | 182 | 4.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 271 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4424 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 818 | |
| 8 | 634 | |
| 9 | 458 | |
| 4 | 388 | |
| 0 | 384 | |
| 2 | 361 | |
| 6 | 349 | |
| 7 | 316 | 7.1% |
| . | 271 | 6.1% |
| 5 | 263 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4424 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 818 | |
| 8 | 634 | |
| 9 | 458 | |
| 4 | 388 | |
| 0 | 384 | |
| 2 | 361 | |
| 6 | 349 | |
| 7 | 316 | 7.1% |
| . | 271 | 6.1% |
| 5 | 263 | 5.9% |
issue
Text
| Distinct | 117 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 13 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 187 |
|---|---|
| Median length | 48 |
| Mean length | 62.38084724 |
| Min length | 48 |
Unique
| Unique | 21 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
|---|---|
| 2nd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 3rd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;TAXON_MATCH_FUZZY |
| 4th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 5th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_DERIVED_FROM_COORDINATES;CONTINENT_INVALID |
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count | 356968 | |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84 | 110523 | 18.4% |
| occurrence_status_inferred_from_individual_count;taxon_match_higherrank | 68486 | 11.4% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates;continent_invalid | 19137 | 3.2% |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country;continent_invalid | 9426 | 1.6% |
| occurrence_status_inferred_from_individual_count;taxon_match_fuzzy | 7808 | 1.3% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;geodetic_datum_invalid | 6304 | 1.0% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;taxon_match_higherrank | 3860 | 0.6% |
| occurrence_status_inferred_from_individual_count;country_derived_from_coordinates;geodetic_datum_assumed_wgs84;continent_invalid | 3671 | 0.6% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_invalid | 3605 | 0.6% |
| Other values (107) | 11650 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 3814538 | |
| R | 3273760 | 8.7% |
| E | 3131386 | 8.3% |
| I | 2892193 | 7.7% |
| N | 2889124 | 7.7% |
| C | 2776336 | 7.4% |
| U | 2757004 | 7.3% |
| T | 2507326 | 6.7% |
| D | 2429693 | 6.5% |
| O | 2238112 | 6.0% |
| Other values (18) | 8808740 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 33066744 | |
| Connector Punctuation | 3814538 | 10.2% |
| Other Punctuation | 329862 | 0.9% |
| Decimal Number | 307068 | 0.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 3273760 | |
| E | 3131386 | |
| I | 2892193 | |
| N | 2889124 | |
| C | 2776336 | |
| U | 2757004 | |
| T | 2507326 | 7.6% |
| D | 2429693 | 7.3% |
| O | 2238112 | 6.8% |
| A | 1842556 | 5.6% |
| Other values (14) | 6329254 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 153534 | |
| 4 | 153534 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3814538 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 329862 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 33066744 | |
| Common | 4451468 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 3273760 | |
| E | 3131386 | |
| I | 2892193 | |
| N | 2889124 | |
| C | 2776336 | |
| U | 2757004 | |
| T | 2507326 | 7.6% |
| D | 2429693 | 7.3% |
| O | 2238112 | 6.8% |
| A | 1842556 | 5.6% |
| Other values (14) | 6329254 |
Common
| Value | Count | Frequency (%) |
| _ | 3814538 | |
| ; | 329862 | 7.4% |
| 8 | 153534 | 3.4% |
| 4 | 153534 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37518212 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 3814538 | |
| R | 3273760 | 8.7% |
| E | 3131386 | 8.3% |
| I | 2892193 | 7.7% |
| N | 2889124 | 7.7% |
| C | 2776336 | 7.4% |
| U | 2757004 | 7.3% |
| T | 2507326 | 6.7% |
| D | 2429693 | 6.5% |
| O | 2238112 | 6.0% |
| Other values (18) | 8808740 |
mediaType
Text
Missing 
| Distinct | 55 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 45831 |
| Missing (%) | 7.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 1385 |
|---|---|
| Median length | 10 |
| Mean length | 11.66078975 |
| Min length | 10 |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | StillImage |
|---|---|
| 2nd row | StillImage |
| 3rd row | StillImage |
| 4th row | StillImage |
| 5th row | StillImage |
| Value | Count | Frequency (%) |
| stillimage | 509092 | |
| stillimage;stillimage | 38794 | 7.0% |
| stillimage;stillimage;stillimage | 2854 | 0.5% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 1339 | 0.2% |
| stillimage;stillimage;stillimage;stillimage | 1231 | 0.2% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 614 | 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage | 321 | 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 256 | < 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 250 | < 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 170 | < 0.1% |
| Other values (45) | 699 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 1279016 | |
| S | 639508 | |
| t | 639508 | |
| i | 639508 | |
| I | 639508 | |
| m | 639508 | |
| a | 639508 | |
| g | 639508 | |
| e | 639508 | |
| ; | 83888 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5116064 | |
| Uppercase Letter | 1279016 | 19.7% |
| Other Punctuation | 83888 | 1.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 1279016 | |
| t | 639508 | |
| i | 639508 | |
| m | 639508 | |
| a | 639508 | |
| g | 639508 | |
| e | 639508 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 639508 | |
| I | 639508 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 83888 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6395080 | |
| Common | 83888 | 1.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 1279016 | |
| S | 639508 | |
| t | 639508 | |
| i | 639508 | |
| I | 639508 | |
| m | 639508 | |
| a | 639508 | |
| g | 639508 | |
| e | 639508 |
Common
| Value | Count | Frequency (%) |
| ; | 83888 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6478968 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 1279016 | |
| S | 639508 | |
| t | 639508 | |
| i | 639508 | |
| I | 639508 | |
| m | 639508 | |
| a | 639508 | |
| g | 639508 | |
| e | 639508 | |
| ; | 83888 | 1.3% |
hasCoordinate
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.744727334 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | true |
| 4th row | false |
| 5th row | true |
| Value | Count | Frequency (%) |
| false | 447917 | |
| true | 153534 | 25.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 601451 | |
| f | 447917 | |
| a | 447917 | |
| l | 447917 | |
| s | 447917 | |
| t | 153534 | 5.4% |
| r | 153534 | 5.4% |
| u | 153534 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2853721 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 601451 | |
| f | 447917 | |
| a | 447917 | |
| l | 447917 | |
| s | 447917 | |
| t | 153534 | 5.4% |
| r | 153534 | 5.4% |
| u | 153534 | 5.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2853721 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 601451 | |
| f | 447917 | |
| a | 447917 | |
| l | 447917 | |
| s | 447917 | |
| t | 153534 | 5.4% |
| r | 153534 | 5.4% |
| u | 153534 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2853721 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 601451 | |
| f | 447917 | |
| a | 447917 | |
| l | 447917 | |
| s | 447917 | |
| t | 153534 | 5.4% |
| r | 153534 | 5.4% |
| u | 153534 | 5.4% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.996536709 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 599368 | |
| true | 2083 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 601451 | |
| f | 599368 | |
| a | 599368 | |
| l | 599368 | |
| s | 599368 | |
| t | 2083 | 0.1% |
| r | 2083 | 0.1% |
| u | 2083 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3005172 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 601451 | |
| f | 599368 | |
| a | 599368 | |
| l | 599368 | |
| s | 599368 | |
| t | 2083 | 0.1% |
| r | 2083 | 0.1% |
| u | 2083 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3005172 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 601451 | |
| f | 599368 | |
| a | 599368 | |
| l | 599368 | |
| s | 599368 | |
| t | 2083 | 0.1% |
| r | 2083 | 0.1% |
| u | 2083 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3005172 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 601451 | |
| f | 599368 | |
| a | 599368 | |
| l | 599368 | |
| s | 599368 | |
| t | 2083 | 0.1% |
| r | 2083 | 0.1% |
| u | 2083 | 0.1% |
taxonKey
Text
| Distinct | 7326 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.991821445 |
| Min length | 2 |
Unique
| Unique | 849 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 2433573 |
|---|---|
| 2nd row | 6163544 |
| 3rd row | 2433177 |
| 4th row | 9119004 |
| 5th row | 2440447 |
| Value | Count | Frequency (%) |
| 2437967 | 13357 | 2.2% |
| 2440447 | 11847 | 2.0% |
| 2438904 | 8874 | 1.5% |
| 2433176 | 8329 | 1.4% |
| 2438019 | 7116 | 1.2% |
| 2433272 | 5470 | 0.9% |
| 2439270 | 5412 | 0.9% |
| 2437782 | 5206 | 0.9% |
| 4264939 | 4687 | 0.8% |
| 5706760 | 4437 | 0.7% |
| Other values (7316) | 526716 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 686795 | |
| 4 | 677503 | |
| 3 | 521933 | |
| 6 | 471038 | |
| 7 | 393586 | |
| 1 | 320627 | |
| 8 | 310437 | |
| 9 | 301609 | |
| 5 | 266002 | 6.3% |
| 0 | 255708 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4205238 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 686795 | |
| 4 | 677503 | |
| 3 | 521933 | |
| 6 | 471038 | |
| 7 | 393586 | |
| 1 | 320627 | |
| 8 | 310437 | |
| 9 | 301609 | |
| 5 | 266002 | 6.3% |
| 0 | 255708 | 6.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4205238 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 686795 | |
| 4 | 677503 | |
| 3 | 521933 | |
| 6 | 471038 | |
| 7 | 393586 | |
| 1 | 320627 | |
| 8 | 310437 | |
| 9 | 301609 | |
| 5 | 266002 | 6.3% |
| 0 | 255708 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4205238 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 686795 | |
| 4 | 677503 | |
| 3 | 521933 | |
| 6 | 471038 | |
| 7 | 393586 | |
| 1 | 320627 | |
| 8 | 310437 | |
| 9 | 301609 | |
| 5 | 266002 | 6.3% |
| 0 | 255708 | 6.1% |
acceptedTaxonKey
Text
| Distinct | 6815 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.999645856 |
| Min length | 2 |
Unique
| Unique | 793 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 2433573 |
|---|---|
| 2nd row | 2438621 |
| 3rd row | 2433177 |
| 4th row | 2438034 |
| 5th row | 2440447 |
| Value | Count | Frequency (%) |
| 2437967 | 14724 | 2.4% |
| 2440447 | 11867 | 2.0% |
| 2438904 | 8874 | 1.5% |
| 2433176 | 8329 | 1.4% |
| 2438019 | 7347 | 1.2% |
| 2438655 | 6840 | 1.1% |
| 2433272 | 5470 | 0.9% |
| 2439270 | 5412 | 0.9% |
| 2437782 | 5206 | 0.9% |
| 4264939 | 4687 | 0.8% |
| Other values (6805) | 522695 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 695586 | |
| 4 | 682390 | |
| 3 | 540567 | |
| 6 | 455861 | |
| 7 | 389135 | |
| 1 | 323452 | |
| 8 | 317316 | |
| 9 | 293353 | |
| 5 | 267471 | 6.4% |
| 0 | 244813 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4209944 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 695586 | |
| 4 | 682390 | |
| 3 | 540567 | |
| 6 | 455861 | |
| 7 | 389135 | |
| 1 | 323452 | |
| 8 | 317316 | |
| 9 | 293353 | |
| 5 | 267471 | 6.4% |
| 0 | 244813 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4209944 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 695586 | |
| 4 | 682390 | |
| 3 | 540567 | |
| 6 | 455861 | |
| 7 | 389135 | |
| 1 | 323452 | |
| 8 | 317316 | |
| 9 | 293353 | |
| 5 | 267471 | 6.4% |
| 0 | 244813 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4209944 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 695586 | |
| 4 | 682390 | |
| 3 | 540567 | |
| 6 | 455861 | |
| 7 | 389135 | |
| 1 | 323452 | |
| 8 | 317316 | |
| 9 | 293353 | |
| 5 | 267471 | 6.4% |
| 0 | 244813 | 5.8% |
kingdomKey
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 601451 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 601451 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 601451 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 601451 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 601451 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 601451 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 601451 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 601451 |
phylumKey
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 44 |
|---|---|
| 2nd row | 44 |
| 3rd row | 44 |
| 4th row | 44 |
| 5th row | 44 |
| Value | Count | Frequency (%) |
| 44 | 601449 | |
| 52 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 1202898 | |
| 5 | 2 | < 0.1% |
| 2 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1202902 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 1202898 | |
| 5 | 2 | < 0.1% |
| 2 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1202902 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 1202898 | |
| 5 | 2 | < 0.1% |
| 2 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1202902 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 1202898 | |
| 5 | 2 | < 0.1% |
| 2 | 2 | < 0.1% |
classKey
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 359 |
|---|---|
| 2nd row | 359 |
| 3rd row | 359 |
| 4th row | 359 |
| 5th row | 359 |
| Value | Count | Frequency (%) |
| 359 | 601448 | |
| 225 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 601450 | |
| 3 | 601448 | |
| 9 | 601448 | |
| 2 | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1804350 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 601450 | |
| 3 | 601448 | |
| 9 | 601448 | |
| 2 | 4 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1804350 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 601450 | |
| 3 | 601448 | |
| 9 | 601448 | |
| 2 | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1804350 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 601450 | |
| 3 | 601448 | |
| 9 | 601448 | |
| 2 | 4 | < 0.1% |
orderKey
Text
| Distinct | 29 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.501132267 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 732 |
|---|---|
| 2nd row | 1459 |
| 3rd row | 734 |
| 4th row | 1459 |
| 5th row | 733 |
| Value | Count | Frequency (%) |
| 1459 | 297636 | |
| 734 | 129084 | |
| 733 | 47588 | 7.9% |
| 732 | 47294 | 7.9% |
| 803 | 30383 | 5.1% |
| 785 | 11977 | 2.0% |
| 731 | 11375 | 1.9% |
| 798 | 10781 | 1.8% |
| 783 | 5645 | 0.9% |
| 1452 | 1652 | 0.3% |
| Other values (19) | 8033 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 431834 | |
| 3 | 321250 | |
| 1 | 314421 | |
| 5 | 313551 | |
| 9 | 311268 | |
| 7 | 266248 | |
| 8 | 63351 | 3.0% |
| 2 | 51029 | 2.4% |
| 0 | 32324 | 1.5% |
| 6 | 473 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2105749 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 431834 | |
| 3 | 321250 | |
| 1 | 314421 | |
| 5 | 313551 | |
| 9 | 311268 | |
| 7 | 266248 | |
| 8 | 63351 | 3.0% |
| 2 | 51029 | 2.4% |
| 0 | 32324 | 1.5% |
| 6 | 473 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2105749 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 431834 | |
| 3 | 321250 | |
| 1 | 314421 | |
| 5 | 313551 | |
| 9 | 311268 | |
| 7 | 266248 | |
| 8 | 63351 | 3.0% |
| 2 | 51029 | 2.4% |
| 0 | 32324 | 1.5% |
| 6 | 473 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2105749 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 431834 | |
| 3 | 321250 | |
| 1 | 314421 | |
| 5 | 313551 | |
| 9 | 311268 | |
| 7 | 266248 | |
| 8 | 63351 | 3.0% |
| 2 | 51029 | 2.4% |
| 0 | 32324 | 1.5% |
| 6 | 473 | < 0.1% |
familyKey
Text
| Distinct | 158 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1158 |
| Missing (%) | 0.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.622670929 |
| Min length | 4 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 5311 |
|---|---|
| 2nd row | 3240723 |
| 3rd row | 9366 |
| 4th row | 3240723 |
| 5th row | 5314 |
| Value | Count | Frequency (%) |
| 3240723 | 107243 | |
| 5510 | 93911 | |
| 9366 | 55530 | 9.3% |
| 9456 | 46130 | 7.7% |
| 5534 | 27470 | 4.6% |
| 5314 | 23642 | 3.9% |
| 9368 | 22260 | 3.7% |
| 5504 | 19997 | 3.3% |
| 5719 | 13560 | 2.3% |
| 9701 | 12559 | 2.1% |
| Other values (148) | 177991 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 475202 | |
| 3 | 474928 | |
| 4 | 299816 | |
| 9 | 285815 | |
| 0 | 267438 | |
| 6 | 258430 | |
| 2 | 257026 | |
| 7 | 205986 | |
| 1 | 200321 | |
| 8 | 49995 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2774957 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 475202 | |
| 3 | 474928 | |
| 4 | 299816 | |
| 9 | 285815 | |
| 0 | 267438 | |
| 6 | 258430 | |
| 2 | 257026 | |
| 7 | 205986 | |
| 1 | 200321 | |
| 8 | 49995 | 1.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2774957 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 475202 | |
| 3 | 474928 | |
| 4 | 299816 | |
| 9 | 285815 | |
| 0 | 267438 | |
| 6 | 258430 | |
| 2 | 257026 | |
| 7 | 205986 | |
| 1 | 200321 | |
| 8 | 49995 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2774957 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 475202 | |
| 3 | 474928 | |
| 4 | 299816 | |
| 9 | 285815 | |
| 0 | 267438 | |
| 6 | 258430 | |
| 2 | 257026 | |
| 7 | 205986 | |
| 1 | 200321 | |
| 8 | 49995 | 1.8% |
genusKey
Text
| Distinct | 1129 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1999 |
| Missing (%) | 0.3% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.000950868 |
| Min length | 7 |
Unique
| Unique | 62 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2433572 |
|---|---|
| 2nd row | 2438591 |
| 3rd row | 2433174 |
| 4th row | 2437961 |
| 5th row | 2440446 |
| Value | Count | Frequency (%) |
| 2437961 | 38753 | 6.5% |
| 2438591 | 19877 | 3.3% |
| 2439223 | 16463 | 2.7% |
| 2435935 | 15826 | 2.6% |
| 2433258 | 12467 | 2.1% |
| 2433174 | 12281 | 2.0% |
| 2440446 | 11894 | 2.0% |
| 2437422 | 11871 | 2.0% |
| 2438904 | 11447 | 1.9% |
| 9800657 | 10554 | 1.8% |
| Other values (1119) | 438019 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 839496 | |
| 4 | 830509 | |
| 3 | 764513 | |
| 9 | 329906 | 7.9% |
| 8 | 279872 | 6.7% |
| 7 | 255740 | 6.1% |
| 5 | 254775 | 6.1% |
| 6 | 229246 | 5.5% |
| 1 | 221313 | 5.3% |
| 0 | 191364 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4196734 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 839496 | |
| 4 | 830509 | |
| 3 | 764513 | |
| 9 | 329906 | 7.9% |
| 8 | 279872 | 6.7% |
| 7 | 255740 | 6.1% |
| 5 | 254775 | 6.1% |
| 6 | 229246 | 5.5% |
| 1 | 221313 | 5.3% |
| 0 | 191364 | 4.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4196734 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 839496 | |
| 4 | 830509 | |
| 3 | 764513 | |
| 9 | 329906 | 7.9% |
| 8 | 279872 | 6.7% |
| 7 | 255740 | 6.1% |
| 5 | 254775 | 6.1% |
| 6 | 229246 | 5.5% |
| 1 | 221313 | 5.3% |
| 0 | 191364 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4196734 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 839496 | |
| 4 | 830509 | |
| 3 | 764513 | |
| 9 | 329906 | 7.9% |
| 8 | 279872 | 6.7% |
| 7 | 255740 | 6.1% |
| 5 | 254775 | 6.1% |
| 6 | 229246 | 5.5% |
| 1 | 221313 | 5.3% |
| 0 | 191364 | 4.6% |
speciesKey
Text
Missing 
| Distinct | 3897 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 29663 |
| Missing (%) | 4.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.006916899 |
| Min length | 7 |
Unique
| Unique | 406 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 2433573 |
|---|---|
| 2nd row | 2438621 |
| 3rd row | 2433177 |
| 4th row | 2438034 |
| 5th row | 2440447 |
| Value | Count | Frequency (%) |
| 2437967 | 15647 | 2.7% |
| 2440447 | 11869 | 2.1% |
| 2433176 | 8329 | 1.5% |
| 2438019 | 7347 | 1.3% |
| 2438655 | 6840 | 1.2% |
| 7429082 | 6399 | 1.1% |
| 2433272 | 5482 | 1.0% |
| 2439270 | 5412 | 0.9% |
| 5219153 | 4558 | 0.8% |
| 5706760 | 4437 | 0.8% |
| Other values (3887) | 495468 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 731613 | |
| 4 | 679263 | |
| 3 | 607462 | |
| 7 | 337688 | |
| 9 | 302340 | |
| 8 | 300825 | |
| 6 | 290793 | 7.3% |
| 5 | 286808 | 7.2% |
| 1 | 253470 | 6.3% |
| 0 | 216209 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4006471 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 731613 | |
| 4 | 679263 | |
| 3 | 607462 | |
| 7 | 337688 | |
| 9 | 302340 | |
| 8 | 300825 | |
| 6 | 290793 | 7.3% |
| 5 | 286808 | 7.2% |
| 1 | 253470 | 6.3% |
| 0 | 216209 | 5.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4006471 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 731613 | |
| 4 | 679263 | |
| 3 | 607462 | |
| 7 | 337688 | |
| 9 | 302340 | |
| 8 | 300825 | |
| 6 | 290793 | 7.3% |
| 5 | 286808 | 7.2% |
| 1 | 253470 | 6.3% |
| 0 | 216209 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4006471 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 731613 | |
| 4 | 679263 | |
| 3 | 607462 | |
| 7 | 337688 | |
| 9 | 302340 | |
| 8 | 300825 | |
| 6 | 290793 | 7.3% |
| 5 | 286808 | 7.2% |
| 1 | 253470 | 6.3% |
| 0 | 216209 | 5.4% |
species
Text
Missing 
| Distinct | 3897 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 29663 |
| Missing (%) | 4.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 26 |
| Mean length | 18.14441541 |
| Min length | 8 |
Unique
| Unique | 406 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Potos flavus |
|---|---|
| 2nd row | Microtus longicaudus |
| 3rd row | Carollia brevicaudum |
| 4th row | Peromyscus mexicanus |
| 5th row | Tursiops truncatus |
| Value | Count | Frequency (%) |
| peromyscus | 38710 | 3.4% |
| rattus | 21793 | 1.9% |
| microtus | 19863 | 1.7% |
| sorex | 15805 | 1.4% |
| maniculatus | 15647 | 1.4% |
| artibeus | 12162 | 1.1% |
| tursiops | 11892 | 1.0% |
| truncatus | 11873 | 1.0% |
| tamias | 11870 | 1.0% |
| carollia | 11315 | 1.0% |
| Other values (3847) | 972646 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 1138591 | 11.0% |
| i | 850505 | 8.2% |
| a | 832841 | 8.0% |
| u | 789136 | 7.6% |
| o | 733517 | 7.1% |
| r | 656869 | 6.3% |
| e | 630821 | 6.1% |
| 571788 | 5.5% | |
| t | 508552 | 4.9% |
| l | 479658 | 4.6% |
| Other values (41) | 3182481 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9231183 | |
| Space Separator | 571788 | 5.5% |
| Uppercase Letter | 571788 | 5.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 1138591 | |
| i | 850505 | 9.2% |
| a | 832841 | 9.0% |
| u | 789136 | 8.5% |
| o | 733517 | 7.9% |
| r | 656869 | 7.1% |
| e | 630821 | 6.8% |
| t | 508552 | 5.5% |
| l | 479658 | 5.2% |
| n | 466895 | 5.1% |
| Other values (16) | 2143798 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 93612 | |
| P | 83184 | |
| C | 55886 | |
| S | 54073 | |
| T | 51158 | |
| A | 32606 | 5.7% |
| R | 30807 | 5.4% |
| L | 22099 | 3.9% |
| D | 21617 | 3.8% |
| N | 20451 | 3.6% |
| Other values (14) | 106295 |
Space Separator
| Value | Count | Frequency (%) |
| 571788 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9802971 | |
| Common | 571788 | 5.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 1138591 | |
| i | 850505 | 8.7% |
| a | 832841 | 8.5% |
| u | 789136 | 8.0% |
| o | 733517 | 7.5% |
| r | 656869 | 6.7% |
| e | 630821 | 6.4% |
| t | 508552 | 5.2% |
| l | 479658 | 4.9% |
| n | 466895 | 4.8% |
| Other values (40) | 2715586 |
Common
| Value | Count | Frequency (%) |
| 571788 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10374759 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 1138591 | 11.0% |
| i | 850505 | 8.2% |
| a | 832841 | 8.0% |
| u | 789136 | 7.6% |
| o | 733517 | 7.1% |
| r | 656869 | 6.3% |
| e | 630821 | 6.1% |
| 571788 | 5.5% | |
| t | 508552 | 4.9% |
| l | 479658 | 4.6% |
| Other values (41) | 3182481 |
| Distinct | 6815 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 147 |
|---|---|
| Median length | 76 |
| Mean length | 34.65024416 |
| Min length | 7 |
Unique
| Unique | 793 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Potos flavus (Schreber, 1774) |
|---|---|
| 2nd row | Microtus longicaudus (Merriam, 1888) |
| 3rd row | Carollia brevicaudum (Schinz, 1821) |
| 4th row | Peromyscus mexicanus (Saussure, 1860) |
| 5th row | Tursiops truncatus (Montagu, 1821) |
| Value | Count | Frequency (%) |
| linnaeus | 54248 | 2.2% |
| 1758 | 48971 | 2.0% |
| thomas | 43832 | 1.8% |
| peromyscus | 38753 | 1.6% |
| merriam | 31855 | 1.3% |
| 24205 | 1.0% | |
| 1821 | 22026 | 0.9% |
| rattus | 21929 | 0.9% |
| wagner | 21848 | 0.9% |
| j.a.allen | 20548 | 0.8% |
| Other values (6260) | 2160706 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1887470 | 9.1% | |
| s | 1613579 | 7.7% |
| a | 1375849 | 6.6% |
| i | 1277042 | 6.1% |
| e | 1193882 | 5.7% |
| r | 1077033 | 5.2% |
| u | 1043189 | 5.0% |
| o | 1026603 | 4.9% |
| n | 888434 | 4.3% |
| l | 794841 | 3.8% |
| Other values (70) | 8662502 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14209920 | |
| Decimal Number | 2149776 | 10.3% |
| Space Separator | 1887470 | 9.1% |
| Uppercase Letter | 1266302 | 6.1% |
| Other Punctuation | 650408 | 3.1% |
| Open Punctuation | 334773 | 1.6% |
| Close Punctuation | 334773 | 1.6% |
| Dash Punctuation | 7002 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 1613579 | |
| a | 1375849 | |
| i | 1277042 | 9.0% |
| e | 1193882 | 8.4% |
| r | 1077033 | 7.6% |
| u | 1043189 | 7.3% |
| o | 1026603 | 7.2% |
| n | 888434 | 6.3% |
| l | 794841 | 5.6% |
| t | 695386 | 4.9% |
| Other values (24) | 3224082 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 177257 | |
| P | 122846 | |
| T | 110795 | 8.7% |
| S | 106664 | 8.4% |
| A | 99376 | 7.8% |
| L | 97931 | 7.7% |
| G | 78980 | 6.2% |
| C | 74570 | 5.9% |
| B | 54633 | 4.3% |
| R | 50431 | 4.0% |
| Other values (18) | 292819 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 658265 | |
| 8 | 464785 | |
| 9 | 213267 | 9.9% |
| 7 | 166016 | 7.7% |
| 5 | 145419 | 6.8% |
| 0 | 128064 | 6.0% |
| 4 | 104493 | 4.9% |
| 3 | 97629 | 4.5% |
| 2 | 90691 | 4.2% |
| 6 | 81147 | 3.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 539214 | |
| . | 86211 | 13.3% |
| & | 24205 | 3.7% |
| ' | 778 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1887470 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 334773 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 334773 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7002 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15476222 | |
| Common | 5364202 | 25.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 1613579 | 10.4% |
| a | 1375849 | 8.9% |
| i | 1277042 | 8.3% |
| e | 1193882 | 7.7% |
| r | 1077033 | 7.0% |
| u | 1043189 | 6.7% |
| o | 1026603 | 6.6% |
| n | 888434 | 5.7% |
| l | 794841 | 5.1% |
| t | 695386 | 4.5% |
| Other values (52) | 4490384 |
Common
| Value | Count | Frequency (%) |
| 1887470 | ||
| 1 | 658265 | 12.3% |
| , | 539214 | 10.1% |
| 8 | 464785 | 8.7% |
| ( | 334773 | 6.2% |
| ) | 334773 | 6.2% |
| 9 | 213267 | 4.0% |
| 7 | 166016 | 3.1% |
| 5 | 145419 | 2.7% |
| 0 | 128064 | 2.4% |
| Other values (8) | 492156 | 9.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20827466 | |
| None | 12958 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1887470 | 9.1% | |
| s | 1613579 | 7.7% |
| a | 1375849 | 6.6% |
| i | 1277042 | 6.1% |
| e | 1193882 | 5.7% |
| r | 1077033 | 5.2% |
| u | 1043189 | 5.0% |
| o | 1026603 | 4.9% |
| n | 888434 | 4.3% |
| l | 794841 | 3.8% |
| Other values (60) | 8649544 |
None
| Value | Count | Frequency (%) |
| ü | 5162 | |
| É | 4278 | |
| é | 1649 | 12.7% |
| è | 1421 | 11.0% |
| ö | 310 | 2.4% |
| í | 70 | 0.5% |
| Ä | 24 | 0.2% |
| ä | 24 | 0.2% |
| á | 19 | 0.1% |
| ñ | 1 | < 0.1% |
| Distinct | 7805 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 43 |
| Mean length | 22.61255364 |
| Min length | 5 |
Unique
| Unique | 898 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Potos flavus |
|---|---|
| 2nd row | Microtus longicaudus longicaudus |
| 3rd row | Carollia brevicauda |
| 4th row | Peromyscus mexicanus totontepecus |
| 5th row | Tursiops truncatus |
| Value | Count | Frequency (%) |
| peromyscus | 38753 | 2.6% |
| sp | 28343 | 1.9% |
| rattus | 21929 | 1.5% |
| microtus | 19877 | 1.3% |
| maniculatus | 15880 | 1.1% |
| sorex | 15831 | 1.1% |
| artibeus | 12470 | 0.8% |
| carollia | 12281 | 0.8% |
| tursiops | 11895 | 0.8% |
| truncatus | 11875 | 0.8% |
| Other values (5505) | 1302266 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 1517215 | 11.2% |
| i | 1187099 | 8.7% |
| a | 1082276 | 8.0% |
| u | 980723 | 7.2% |
| o | 902387 | 6.6% |
| 889949 | 6.5% | |
| e | 862255 | 6.3% |
| r | 848292 | 6.2% |
| n | 665623 | 4.9% |
| l | 634731 | 4.7% |
| Other values (53) | 4029793 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12079597 | |
| Space Separator | 889949 | 6.5% |
| Uppercase Letter | 601771 | 4.4% |
| Other Punctuation | 28356 | 0.2% |
| Open Punctuation | 313 | < 0.1% |
| Close Punctuation | 313 | < 0.1% |
| Decimal Number | 44 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 1517215 | |
| i | 1187099 | |
| a | 1082276 | 9.0% |
| u | 980723 | 8.1% |
| o | 902387 | 7.5% |
| e | 862255 | 7.1% |
| r | 848292 | 7.0% |
| n | 665623 | 5.5% |
| l | 634731 | 5.3% |
| t | 618435 | 5.1% |
| Other values (16) | 2780561 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 103156 | |
| P | 84557 | |
| C | 58907 | |
| S | 54594 | |
| T | 51645 | |
| A | 32571 | 5.4% |
| R | 31119 | 5.2% |
| G | 28180 | 4.7% |
| L | 23175 | 3.9% |
| N | 23069 | 3.8% |
| Other values (14) | 110798 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 13 | |
| 1 | 12 | |
| 2 | 7 | |
| 9 | 6 | |
| 5 | 3 | 6.8% |
| 0 | 2 | 4.5% |
| 4 | 1 | 2.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 28343 | |
| , | 11 | < 0.1% |
| / | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 889949 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 313 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 313 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12681368 | |
| Common | 918975 | 6.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 1517215 | |
| i | 1187099 | 9.4% |
| a | 1082276 | 8.5% |
| u | 980723 | 7.7% |
| o | 902387 | 7.1% |
| e | 862255 | 6.8% |
| r | 848292 | 6.7% |
| n | 665623 | 5.2% |
| l | 634731 | 5.0% |
| t | 618435 | 4.9% |
| Other values (40) | 3382332 |
Common
| Value | Count | Frequency (%) |
| 889949 | ||
| . | 28343 | 3.1% |
| ( | 313 | < 0.1% |
| ) | 313 | < 0.1% |
| 8 | 13 | < 0.1% |
| 1 | 12 | < 0.1% |
| , | 11 | < 0.1% |
| 2 | 7 | < 0.1% |
| 9 | 6 | < 0.1% |
| 5 | 3 | < 0.1% |
| Other values (3) | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13600343 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 1517215 | 11.2% |
| i | 1187099 | 8.7% |
| a | 1082276 | 8.0% |
| u | 980723 | 7.2% |
| o | 902387 | 6.6% |
| 889949 | 6.5% | |
| e | 862255 | 6.3% |
| r | 848292 | 6.2% |
| n | 665623 | 4.9% |
| l | 634731 | 4.7% |
| Other values (53) | 4029793 |
protocol
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EML |
|---|---|
| 2nd row | EML |
| 3rd row | EML |
| 4th row | EML |
| 5th row | EML |
| Value | Count | Frequency (%) |
| eml | 601451 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 601451 | |
| M | 601451 | |
| L | 601451 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1804353 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 601451 | |
| M | 601451 | |
| L | 601451 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1804353 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 601451 | |
| M | 601451 | |
| L | 601451 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1804353 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 601451 | |
| M | 601451 | |
| L | 601451 |
lastParsed
Text
| Distinct | 185984 |
|---|---|
| Distinct (%) | 30.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99573698 |
| Min length | 20 |
Unique
| Unique | 38937 ? |
|---|---|
| Unique (%) | 6.5% |
Sample
| 1st row | 2024-12-02T13:58:01.255Z |
|---|---|
| 2nd row | 2024-12-02T13:59:38.442Z |
| 3rd row | 2024-12-02T13:56:07.605Z |
| 4th row | 2024-12-02T13:58:24.850Z |
| 5th row | 2024-12-02T13:56:12.476Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:14.377z | 17 | < 0.1% |
| 2024-12-02t13:57:24.313z | 17 | < 0.1% |
| 2024-12-02t13:57:59.063z | 17 | < 0.1% |
| 2024-12-02t13:57:52.813z | 17 | < 0.1% |
| 2024-12-02t13:57:15.231z | 17 | < 0.1% |
| 2024-12-02t13:57:50.062z | 16 | < 0.1% |
| 2024-12-02t13:57:52.024z | 16 | < 0.1% |
| 2024-12-02t13:57:25.776z | 16 | < 0.1% |
| 2024-12-02t13:56:59.760z | 15 | < 0.1% |
| 2024-12-02t13:57:24.391z | 15 | < 0.1% |
| Other values (185974) | 601288 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2746380 | |
| 0 | 1525337 | |
| 1 | 1517832 | |
| - | 1202902 | |
| : | 1202902 | |
| 4 | 967155 | 6.7% |
| 5 | 955236 | 6.6% |
| 3 | 952306 | 6.6% |
| T | 601451 | 4.2% |
| Z | 601451 | 4.2% |
| Other values (5) | 2159308 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10222744 | |
| Other Punctuation | 1803712 | 12.5% |
| Dash Punctuation | 1202902 | 8.3% |
| Uppercase Letter | 1202902 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2746380 | |
| 0 | 1525337 | |
| 1 | 1517832 | |
| 4 | 967155 | 9.5% |
| 5 | 955236 | 9.3% |
| 3 | 952306 | 9.3% |
| 7 | 460995 | 4.5% |
| 9 | 384640 | 3.8% |
| 6 | 362872 | 3.5% |
| 8 | 349991 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1202902 | |
| . | 600810 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 601451 | |
| Z | 601451 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1202902 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13229358 | |
| Latin | 1202902 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2746380 | |
| 0 | 1525337 | |
| 1 | 1517832 | |
| - | 1202902 | |
| : | 1202902 | |
| 4 | 967155 | 7.3% |
| 5 | 955236 | 7.2% |
| 3 | 952306 | 7.2% |
| . | 600810 | 4.5% |
| 7 | 460995 | 3.5% |
| Other values (3) | 1097503 | 8.3% |
Latin
| Value | Count | Frequency (%) |
| T | 601451 | |
| Z | 601451 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14432260 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2746380 | |
| 0 | 1525337 | |
| 1 | 1517832 | |
| - | 1202902 | |
| : | 1202902 | |
| 4 | 967155 | 6.7% |
| 5 | 955236 | 6.6% |
| 3 | 952306 | 6.6% |
| T | 601451 | 4.2% |
| Z | 601451 | 4.2% |
| Other values (5) | 2159308 |
lastCrawled
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2024-12-02T11:48:23.416Z |
|---|---|
| 2nd row | 2024-12-02T11:48:23.416Z |
| 3rd row | 2024-12-02T11:48:23.416Z |
| 4th row | 2024-12-02T11:48:23.416Z |
| 5th row | 2024-12-02T11:48:23.416Z |
| Value | Count | Frequency (%) |
| 2024-12-02t11:48:23.416z | 601451 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3007255 | |
| 1 | 2405804 | |
| 4 | 1804353 | |
| 0 | 1202902 | 8.3% |
| - | 1202902 | 8.3% |
| : | 1202902 | 8.3% |
| T | 601451 | 4.2% |
| 8 | 601451 | 4.2% |
| 3 | 601451 | 4.2% |
| . | 601451 | 4.2% |
| Other values (2) | 1202902 | 8.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10224667 | |
| Other Punctuation | 1804353 | 12.5% |
| Dash Punctuation | 1202902 | 8.3% |
| Uppercase Letter | 1202902 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3007255 | |
| 1 | 2405804 | |
| 4 | 1804353 | |
| 0 | 1202902 | 11.8% |
| 8 | 601451 | 5.9% |
| 3 | 601451 | 5.9% |
| 6 | 601451 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1202902 | |
| . | 601451 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 601451 | |
| Z | 601451 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1202902 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13231922 | |
| Latin | 1202902 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 3007255 | |
| 1 | 2405804 | |
| 4 | 1804353 | |
| 0 | 1202902 | 9.1% |
| - | 1202902 | 9.1% |
| : | 1202902 | 9.1% |
| 8 | 601451 | 4.5% |
| 3 | 601451 | 4.5% |
| . | 601451 | 4.5% |
| 6 | 601451 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| T | 601451 | |
| Z | 601451 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14434824 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 3007255 | |
| 1 | 2405804 | |
| 4 | 1804353 | |
| 0 | 1202902 | 8.3% |
| - | 1202902 | 8.3% |
| : | 1202902 | 8.3% |
| T | 601451 | 4.2% |
| 8 | 601451 | 4.2% |
| 3 | 601451 | 4.2% |
| . | 601451 | 4.2% |
| Other values (2) | 1202902 | 8.3% |
repatriated
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2505 |
| Missing (%) | 0.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.377813693 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | true |
|---|---|
| 2nd row | false |
| 3rd row | true |
| 4th row | true |
| 5th row | false |
| Value | Count | Frequency (%) |
| true | 372656 | |
| false | 226290 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 598946 | |
| t | 372656 | |
| r | 372656 | |
| u | 372656 | |
| f | 226290 | 8.6% |
| a | 226290 | 8.6% |
| l | 226290 | 8.6% |
| s | 226290 | 8.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2622074 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 598946 | |
| t | 372656 | |
| r | 372656 | |
| u | 372656 | |
| f | 226290 | 8.6% |
| a | 226290 | 8.6% |
| l | 226290 | 8.6% |
| s | 226290 | 8.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2622074 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 598946 | |
| t | 372656 | |
| r | 372656 | |
| u | 372656 | |
| f | 226290 | 8.6% |
| a | 226290 | 8.6% |
| l | 226290 | 8.6% |
| s | 226290 | 8.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2622074 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 598946 | |
| t | 372656 | |
| r | 372656 | |
| u | 372656 | |
| f | 226290 | 8.6% |
| a | 226290 | 8.6% |
| l | 226290 | 8.6% |
| s | 226290 | 8.6% |
isSequenced
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.998247571 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 600397 | |
| true | 1054 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 601451 | |
| f | 600397 | |
| a | 600397 | |
| l | 600397 | |
| s | 600397 | |
| t | 1054 | < 0.1% |
| r | 1054 | < 0.1% |
| u | 1054 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3006201 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 601451 | |
| f | 600397 | |
| a | 600397 | |
| l | 600397 | |
| s | 600397 | |
| t | 1054 | < 0.1% |
| r | 1054 | < 0.1% |
| u | 1054 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3006201 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 601451 | |
| f | 600397 | |
| a | 600397 | |
| l | 600397 | |
| s | 600397 | |
| t | 1054 | < 0.1% |
| r | 1054 | < 0.1% |
| u | 1054 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3006201 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 601451 | |
| f | 600397 | |
| a | 600397 | |
| l | 600397 | |
| s | 600397 | |
| t | 1054 | < 0.1% |
| r | 1054 | < 0.1% |
| u | 1054 | < 0.1% |
gbifRegion
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 15955 |
| Missing (%) | 2.7% |
| Memory size | 4.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 10.49816395 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | LATIN_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | LATIN_AMERICA |
| 4th row | LATIN_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 245840 | |
| latin_america | 145714 | |
| africa | 101325 | |
| asia | 63583 | 10.9% |
| europe | 17807 | 3.0% |
| oceania | 8321 | 1.4% |
| antarctica | 2906 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1283998 | |
| R | 759432 | |
| I | 713403 | |
| C | 507012 | 8.2% |
| E | 435489 | 7.1% |
| N | 402781 | 6.6% |
| T | 397366 | 6.5% |
| _ | 391554 | 6.4% |
| M | 391554 | 6.4% |
| O | 271968 | 4.4% |
| Other values (6) | 592076 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5755079 | |
| Connector Punctuation | 391554 | 6.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1283998 | |
| R | 759432 | |
| I | 713403 | |
| C | 507012 | 8.8% |
| E | 435489 | 7.6% |
| N | 402781 | 7.0% |
| T | 397366 | 6.9% |
| M | 391554 | 6.8% |
| O | 271968 | 4.7% |
| H | 245840 | 4.3% |
| Other values (5) | 346236 | 6.0% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 391554 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5755079 | |
| Common | 391554 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1283998 | |
| R | 759432 | |
| I | 713403 | |
| C | 507012 | 8.8% |
| E | 435489 | 7.6% |
| N | 402781 | 7.0% |
| T | 397366 | 6.9% |
| M | 391554 | 6.8% |
| O | 271968 | 4.7% |
| H | 245840 | 4.3% |
| Other values (5) | 346236 | 6.0% |
Common
| Value | Count | Frequency (%) |
| _ | 391554 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6146633 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1283998 | |
| R | 759432 | |
| I | 713403 | |
| C | 507012 | 8.2% |
| E | 435489 | 7.1% |
| N | 402781 | 6.6% |
| T | 397366 | 6.5% |
| _ | 391554 | 6.4% |
| M | 391554 | 6.4% |
| O | 271968 | 4.4% |
| Other values (6) | 592076 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 601451 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 1202902 | |
| A | 1202902 | |
| N | 601451 | |
| O | 601451 | |
| T | 601451 | |
| H | 601451 | |
| _ | 601451 | |
| M | 601451 | |
| E | 601451 | |
| I | 601451 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7217412 | |
| Connector Punctuation | 601451 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 1202902 | |
| A | 1202902 | |
| N | 601451 | |
| O | 601451 | |
| T | 601451 | |
| H | 601451 | |
| M | 601451 | |
| E | 601451 | |
| I | 601451 | |
| C | 601451 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 601451 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7217412 | |
| Common | 601451 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 1202902 | |
| A | 1202902 | |
| N | 601451 | |
| O | 601451 | |
| T | 601451 | |
| H | 601451 | |
| M | 601451 | |
| E | 601451 | |
| I | 601451 | |
| C | 601451 |
Common
| Value | Count | Frequency (%) |
| _ | 601451 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7818863 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 1202902 | |
| A | 1202902 | |
| N | 601451 | |
| O | 601451 | |
| T | 601451 | |
| H | 601451 | |
| _ | 601451 | |
| M | 601451 | |
| E | 601451 | |
| I | 601451 |
level0Gid
Text
Missing 
| Distinct | 157 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 473902 |
| Missing (%) | 78.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 22 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | VEN |
|---|---|
| 2nd row | AFG |
| 3rd row | Z01 |
| 4th row | VEN |
| 5th row | ZAF |
| Value | Count | Frequency (%) |
| ven | 22481 | |
| usa | 11290 | 8.9% |
| zaf | 9365 | 7.3% |
| gha | 6969 | 5.5% |
| mar | 6781 | 5.3% |
| idn | 6468 | 5.1% |
| bwa | 4488 | 3.5% |
| bfa | 4128 | 3.2% |
| moz | 3329 | 2.6% |
| pan | 3025 | 2.4% |
| Other values (147) | 49225 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 57657 | |
| N | 44072 | 11.5% |
| E | 35574 | 9.3% |
| V | 25784 | 6.7% |
| M | 21228 | 5.5% |
| S | 19507 | 5.1% |
| Z | 16511 | 4.3% |
| G | 16274 | 4.3% |
| F | 15976 | 4.2% |
| B | 15801 | 4.1% |
| Other values (19) | 114263 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 379759 | |
| Decimal Number | 2888 | 0.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 57657 | |
| N | 44072 | 11.6% |
| E | 35574 | 9.4% |
| V | 25784 | 6.8% |
| M | 21228 | 5.6% |
| S | 19507 | 5.1% |
| Z | 16511 | 4.3% |
| G | 16274 | 4.3% |
| F | 15976 | 4.2% |
| B | 15801 | 4.2% |
| Other values (16) | 111375 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1444 | |
| 1 | 1146 | |
| 6 | 298 | 10.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 379759 | |
| Common | 2888 | 0.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 57657 | |
| N | 44072 | 11.6% |
| E | 35574 | 9.4% |
| V | 25784 | 6.8% |
| M | 21228 | 5.6% |
| S | 19507 | 5.1% |
| Z | 16511 | 4.3% |
| G | 16274 | 4.3% |
| F | 15976 | 4.2% |
| B | 15801 | 4.2% |
| Other values (16) | 111375 |
Common
| Value | Count | Frequency (%) |
| 0 | 1444 | |
| 1 | 1146 | |
| 6 | 298 | 10.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 382647 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 57657 | |
| N | 44072 | 11.5% |
| E | 35574 | 9.3% |
| V | 25784 | 6.7% |
| M | 21228 | 5.5% |
| S | 19507 | 5.1% |
| Z | 16511 | 4.3% |
| G | 16274 | 4.3% |
| F | 15976 | 4.2% |
| B | 15801 | 4.1% |
| Other values (19) | 114263 |
level0Name
Text
Missing 
| Distinct | 157 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 473902 |
| Missing (%) | 78.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 24 |
| Mean length | 9.472931971 |
| Min length | 4 |
Unique
| Unique | 22 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Venezuela |
|---|---|
| 2nd row | Afghanistan |
| 3rd row | Jammu and Kashmir |
| 4th row | Venezuela |
| 5th row | South Africa |
| Value | Count | Frequency (%) |
| venezuela | 22481 | 13.1% |
| united | 11945 | 7.0% |
| states | 11376 | 6.6% |
| south | 10173 | 5.9% |
| africa | 9365 | 5.5% |
| ghana | 6969 | 4.1% |
| morocco | 6781 | 3.9% |
| indonesia | 6468 | 3.8% |
| botswana | 4488 | 2.6% |
| burkina | 4128 | 2.4% |
| Other values (185) | 77505 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 157852 | 13.1% |
| e | 138859 | 11.5% |
| n | 95632 | 7.9% |
| i | 91451 | 7.6% |
| o | 66935 | 5.5% |
| t | 62305 | 5.2% |
| u | 52405 | 4.3% |
| r | 46349 | 3.8% |
| 44130 | 3.7% | |
| l | 37921 | 3.1% |
| Other values (49) | 414424 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 992249 | |
| Uppercase Letter | 168966 | 14.0% |
| Space Separator | 44130 | 3.7% |
| Other Punctuation | 2902 | 0.2% |
| Dash Punctuation | 14 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 157852 | |
| e | 138859 | |
| n | 95632 | |
| i | 91451 | |
| o | 66935 | 6.7% |
| t | 62305 | 6.3% |
| u | 52405 | 5.3% |
| r | 46349 | 4.7% |
| l | 37921 | 3.8% |
| s | 37179 | 3.7% |
| Other values (19) | 205361 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 25721 | |
| V | 22909 | |
| M | 16407 | |
| B | 14067 | |
| A | 12684 | |
| U | 12397 | |
| G | 10000 | 5.9% |
| I | 9818 | 5.8% |
| P | 7377 | 4.4% |
| C | 7240 | 4.3% |
| Other values (13) | 30346 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 2873 | |
| . | 18 | 0.6% |
| , | 11 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 44130 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 14 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1161215 | |
| Common | 47048 | 3.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 157852 | |
| e | 138859 | 12.0% |
| n | 95632 | 8.2% |
| i | 91451 | 7.9% |
| o | 66935 | 5.8% |
| t | 62305 | 5.4% |
| u | 52405 | 4.5% |
| r | 46349 | 4.0% |
| l | 37921 | 3.3% |
| s | 37179 | 3.2% |
| Other values (42) | 374327 |
Common
| Value | Count | Frequency (%) |
| 44130 | ||
| ' | 2873 | 6.1% |
| . | 18 | < 0.1% |
| - | 14 | < 0.1% |
| , | 11 | < 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1205267 | |
| None | 2996 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 157852 | 13.1% |
| e | 138859 | 11.5% |
| n | 95632 | 7.9% |
| i | 91451 | 7.6% |
| o | 66935 | 5.6% |
| t | 62305 | 5.2% |
| u | 52405 | 4.3% |
| r | 46349 | 3.8% |
| 44130 | 3.7% | |
| l | 37921 | 3.1% |
| Other values (46) | 411428 |
None
| Value | Count | Frequency (%) |
| ô | 2873 | |
| é | 122 | 4.1% |
| ç | 1 | < 0.1% |
level1Gid
Text
Missing 
| Distinct | 906 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 473930 |
| Missing (%) | 78.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.41927212 |
| Min length | 6 |
Unique
| Unique | 191 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | VEN.6_1 |
|---|---|
| 2nd row | AFG.15_1 |
| 3rd row | Z01.14_1 |
| 4th row | VEN.1_1 |
| 5th row | ZAF.8_1 |
| Value | Count | Frequency (%) |
| ven.1_1 | 6194 | 4.9% |
| zaf.8_1 | 3031 | 2.4% |
| ven.6_1 | 2186 | 1.7% |
| bwa.12_1 | 2159 | 1.7% |
| ven.12_1 | 1504 | 1.2% |
| caf.16_1 | 1500 | 1.2% |
| eth.8_1 | 1491 | 1.2% |
| mar.6_1 | 1470 | 1.2% |
| ven.24_1 | 1465 | 1.1% |
| mar.12_1 | 1449 | 1.1% |
| Other values (896) | 105072 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 179942 | |
| _ | 127518 | |
| . | 120552 | |
| A | 57624 | 6.1% |
| N | 44072 | 4.7% |
| 2 | 40011 | 4.2% |
| E | 35574 | 3.8% |
| V | 25784 | 2.7% |
| M | 21227 | 2.2% |
| 4 | 20211 | 2.1% |
| Other values (28) | 273598 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 379684 | |
| Decimal Number | 318359 | |
| Connector Punctuation | 127518 | 13.5% |
| Other Punctuation | 120552 | 12.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 57624 | |
| N | 44072 | 11.6% |
| E | 35574 | 9.4% |
| V | 25784 | 6.8% |
| M | 21227 | 5.6% |
| S | 19503 | 5.1% |
| Z | 16511 | 4.3% |
| G | 16272 | 4.3% |
| F | 15975 | 4.2% |
| B | 15797 | 4.2% |
| Other values (16) | 111345 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 179942 | |
| 2 | 40011 | 12.6% |
| 4 | 20211 | 6.3% |
| 3 | 16182 | 5.1% |
| 6 | 13323 | 4.2% |
| 5 | 12363 | 3.9% |
| 0 | 10706 | 3.4% |
| 8 | 10060 | 3.2% |
| 7 | 8024 | 2.5% |
| 9 | 7537 | 2.4% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 127518 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 120552 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 566429 | |
| Latin | 379684 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 57624 | |
| N | 44072 | 11.6% |
| E | 35574 | 9.4% |
| V | 25784 | 6.8% |
| M | 21227 | 5.6% |
| S | 19503 | 5.1% |
| Z | 16511 | 4.3% |
| G | 16272 | 4.3% |
| F | 15975 | 4.2% |
| B | 15797 | 4.2% |
| Other values (16) | 111345 |
Common
| Value | Count | Frequency (%) |
| 1 | 179942 | |
| _ | 127518 | |
| . | 120552 | |
| 2 | 40011 | 7.1% |
| 4 | 20211 | 3.6% |
| 3 | 16182 | 2.9% |
| 6 | 13323 | 2.4% |
| 5 | 12363 | 2.2% |
| 0 | 10706 | 1.9% |
| 8 | 10060 | 1.8% |
| Other values (2) | 15561 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 946113 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 179942 | |
| _ | 127518 | |
| . | 120552 | |
| A | 57624 | 6.1% |
| N | 44072 | 4.7% |
| 2 | 40011 | 4.2% |
| E | 35574 | 3.8% |
| V | 25784 | 2.7% |
| M | 21227 | 2.2% |
| 4 | 20211 | 2.1% |
| Other values (28) | 273598 |
level1Name
Text
Missing 
| Distinct | 882 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 473930 |
| Missing (%) | 78.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 9.408136699 |
| Min length | 3 |
Unique
| Unique | 187 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Bolívar |
|---|---|
| 2nd row | Kandahar |
| 3rd row | Jammu and Kashmir |
| 4th row | Amazonas |
| 5th row | Northern Cape |
| Value | Count | Frequency (%) |
| 8657 | 4.8% | |
| amazonas | 6326 | 3.5% |
| cape | 5248 | 2.9% |
| northern | 4629 | 2.6% |
| eastern | 4155 | 2.3% |
| bolívar | 2189 | 1.2% |
| north-west | 2159 | 1.2% |
| barat | 2126 | 1.2% |
| west | 1913 | 1.1% |
| western | 1800 | 1.0% |
| Other values (1015) | 140238 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 186875 | |
| r | 89084 | 7.4% |
| n | 79844 | 6.7% |
| e | 79607 | 6.6% |
| o | 67186 | 5.6% |
| t | 58756 | 4.9% |
| i | 53839 | 4.5% |
| s | 53114 | 4.4% |
| 51919 | 4.3% | |
| l | 43110 | 3.6% |
| Other values (80) | 436401 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 949271 | |
| Uppercase Letter | 178927 | 14.9% |
| Space Separator | 51919 | 4.3% |
| Dash Punctuation | 19040 | 1.6% |
| Other Punctuation | 578 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 186875 | |
| r | 89084 | |
| n | 79844 | |
| e | 79607 | |
| o | 67186 | 7.1% |
| t | 58756 | 6.2% |
| i | 53839 | 5.7% |
| s | 53114 | 5.6% |
| l | 43110 | 4.5% |
| u | 40632 | 4.3% |
| Other values (47) | 197224 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 17121 | 9.6% |
| C | 16461 | 9.2% |
| N | 15709 | 8.8% |
| A | 15683 | 8.8% |
| M | 14877 | 8.3% |
| B | 11765 | 6.6% |
| T | 11508 | 6.4% |
| E | 10199 | 5.7% |
| K | 8533 | 4.8% |
| W | 6887 | 3.8% |
| Other values (18) | 50184 |
Other Punctuation
| Value | Count | Frequency (%) |
| ! | 398 | |
| ' | 106 | 18.3% |
| , | 74 | 12.8% |
Space Separator
| Value | Count | Frequency (%) |
| 51919 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19040 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1128198 | |
| Common | 71537 | 6.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 186875 | |
| r | 89084 | 7.9% |
| n | 79844 | 7.1% |
| e | 79607 | 7.1% |
| o | 67186 | 6.0% |
| t | 58756 | 5.2% |
| i | 53839 | 4.8% |
| s | 53114 | 4.7% |
| l | 43110 | 3.8% |
| u | 40632 | 3.6% |
| Other values (75) | 376151 |
Common
| Value | Count | Frequency (%) |
| 51919 | ||
| - | 19040 | 26.6% |
| ! | 398 | 0.6% |
| ' | 106 | 0.1% |
| , | 74 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1180925 | |
| None | 18427 | 1.5% |
| Latin Ext Additional | 383 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 186875 | |
| r | 89084 | 7.5% |
| n | 79844 | 6.8% |
| e | 79607 | 6.7% |
| o | 67186 | 5.7% |
| t | 58756 | 5.0% |
| i | 53839 | 4.6% |
| s | 53114 | 4.5% |
| 51919 | 4.4% | |
| l | 43110 | 3.7% |
| Other values (47) | 417591 |
None
| Value | Count | Frequency (%) |
| é | 6270 | |
| á | 3586 | |
| í | 3080 | |
| ó | 2234 | 12.1% |
| â | 1779 | 9.7% |
| è | 833 | 4.5% |
| Đ | 250 | 1.4% |
| ô | 144 | 0.8% |
| à | 101 | 0.5% |
| ò | 51 | 0.3% |
| Other values (14) | 99 | 0.5% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ồ | 132 | |
| ẵ | 92 | |
| ắ | 60 | |
| ậ | 57 | |
| ả | 19 | 5.0% |
| ị | 11 | 2.9% |
| ế | 5 | 1.3% |
| ừ | 5 | 1.3% |
| ệ | 2 | 0.5% |
level2Gid
Text
Missing 
| Distinct | 2378 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 475037 |
| Missing (%) | 79.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 9.659871533 |
| Min length | 7 |
Unique
| Unique | 658 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | VEN.6.10_1 |
|---|---|
| 2nd row | AFG.15.3_1 |
| 3rd row | Z01.14.3_1 |
| 4th row | VEN.1.6_1 |
| 5th row | ZAF.8.5_1 |
| Value | Count | Frequency (%) |
| ven.1.5_1 | 2542 | 2.0% |
| bwa.12.2_1 | 1980 | 1.6% |
| ven.1.1_1 | 1644 | 1.3% |
| caf.16.2_1 | 1500 | 1.2% |
| eth.8.8_1 | 1196 | 0.9% |
| zaf.8.5_1 | 1077 | 0.9% |
| ven.6.10_1 | 1052 | 0.8% |
| sle.2.1_1 | 1049 | 0.8% |
| sle.1.2_1 | 1037 | 0.8% |
| zaf.8.4_1 | 1035 | 0.8% |
| Other values (2368) | 112302 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 245856 | |
| 1 | 216878 | |
| _ | 126414 | 10.4% |
| 2 | 73592 | 6.0% |
| A | 57593 | 4.7% |
| N | 44067 | 3.6% |
| E | 35567 | 2.9% |
| 4 | 35248 | 2.9% |
| 3 | 33422 | 2.7% |
| 5 | 26946 | 2.2% |
| Other values (28) | 325560 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 472519 | |
| Uppercase Letter | 376354 | |
| Other Punctuation | 245856 | |
| Connector Punctuation | 126414 | 10.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 57593 | |
| N | 44067 | |
| E | 35567 | 9.5% |
| V | 25781 | 6.9% |
| M | 21036 | 5.6% |
| S | 19455 | 5.2% |
| Z | 16332 | 4.3% |
| G | 16226 | 4.3% |
| F | 15974 | 4.2% |
| B | 15301 | 4.1% |
| Other values (16) | 109022 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 216878 | |
| 2 | 73592 | 15.6% |
| 4 | 35248 | 7.5% |
| 3 | 33422 | 7.1% |
| 5 | 26946 | 5.7% |
| 6 | 22912 | 4.8% |
| 0 | 17324 | 3.7% |
| 8 | 16328 | 3.5% |
| 7 | 15121 | 3.2% |
| 9 | 14748 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 245856 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 126414 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 844789 | |
| Latin | 376354 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 57593 | |
| N | 44067 | |
| E | 35567 | 9.5% |
| V | 25781 | 6.9% |
| M | 21036 | 5.6% |
| S | 19455 | 5.2% |
| Z | 16332 | 4.3% |
| G | 16226 | 4.3% |
| F | 15974 | 4.2% |
| B | 15301 | 4.1% |
| Other values (16) | 109022 |
Common
| Value | Count | Frequency (%) |
| . | 245856 | |
| 1 | 216878 | |
| _ | 126414 | |
| 2 | 73592 | 8.7% |
| 4 | 35248 | 4.2% |
| 3 | 33422 | 4.0% |
| 5 | 26946 | 3.2% |
| 6 | 22912 | 2.7% |
| 0 | 17324 | 2.1% |
| 8 | 16328 | 1.9% |
| Other values (2) | 29869 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1221143 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 245856 | |
| 1 | 216878 | |
| _ | 126414 | 10.4% |
| 2 | 73592 | 6.0% |
| A | 57593 | 4.7% |
| N | 44067 | 3.6% |
| E | 35567 | 2.9% |
| 4 | 35248 | 2.9% |
| 3 | 33422 | 2.7% |
| 5 | 26946 | 2.2% |
| Other values (28) | 325560 |
level2Name
Text
Missing 
| Distinct | 2276 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 475037 |
| Missing (%) | 79.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 8.596326356 |
| Min length | 2 |
Unique
| Unique | 606 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Sifontes |
|---|---|
| 2nd row | Daman |
| 3rd row | Bandipore |
| 4th row | Maroa |
| 5th row | Siyanda |
| Value | Count | Frequency (%) |
| west | 3557 | 2.1% |
| manapiare | 2542 | 1.5% |
| ngamiland | 2159 | 1.3% |
| south | 2001 | 1.2% |
| alto | 1647 | 1.0% |
| orinoco | 1644 | 1.0% |
| east | 1641 | 1.0% |
| nola | 1500 | 0.9% |
| bolívar | 1475 | 0.9% |
| miranda | 1344 | 0.8% |
| Other values (2539) | 146175 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 156174 | 14.4% |
| o | 74427 | 6.8% |
| n | 72702 | 6.7% |
| e | 72325 | 6.7% |
| i | 68255 | 6.3% |
| r | 58313 | 5.4% |
| t | 45881 | 4.2% |
| u | 41995 | 3.9% |
| 39271 | 3.6% | |
| l | 37304 | 3.4% |
| Other values (111) | 420049 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 874904 | |
| Uppercase Letter | 166128 | 15.3% |
| Space Separator | 39271 | 3.6% |
| Dash Punctuation | 3492 | 0.3% |
| Other Punctuation | 1688 | 0.2% |
| Decimal Number | 661 | 0.1% |
| Open Punctuation | 293 | < 0.1% |
| Close Punctuation | 259 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 156174 | |
| o | 74427 | 8.5% |
| n | 72702 | 8.3% |
| e | 72325 | 8.3% |
| i | 68255 | 7.8% |
| r | 58313 | 6.7% |
| t | 45881 | 5.2% |
| u | 41995 | 4.8% |
| l | 37304 | 4.3% |
| s | 33386 | 3.8% |
| Other values (59) | 214142 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 17317 | 10.4% |
| M | 16983 | 10.2% |
| B | 14294 | 8.6% |
| A | 13999 | 8.4% |
| C | 12443 | 7.5% |
| K | 12120 | 7.3% |
| N | 10665 | 6.4% |
| T | 9065 | 5.5% |
| G | 7232 | 4.4% |
| P | 6475 | 3.9% |
| Other values (24) | 45535 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 386 | |
| 0 | 164 | |
| 3 | 52 | 7.9% |
| 7 | 37 | 5.6% |
| 5 | 9 | 1.4% |
| 2 | 5 | 0.8% |
| 8 | 3 | 0.5% |
| 9 | 2 | 0.3% |
| 6 | 2 | 0.3% |
| 4 | 1 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 823 | |
| / | 453 | |
| . | 408 | |
| , | 4 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 39271 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3492 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 293 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 259 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1041032 | |
| Common | 45664 | 4.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 156174 | |
| o | 74427 | 7.1% |
| n | 72702 | 7.0% |
| e | 72325 | 6.9% |
| i | 68255 | 6.6% |
| r | 58313 | 5.6% |
| t | 45881 | 4.4% |
| u | 41995 | 4.0% |
| l | 37304 | 3.6% |
| s | 33386 | 3.2% |
| Other values (93) | 380270 |
Common
| Value | Count | Frequency (%) |
| 39271 | ||
| - | 3492 | 7.6% |
| ' | 823 | 1.8% |
| / | 453 | 1.0% |
| . | 408 | 0.9% |
| 1 | 386 | 0.8% |
| ( | 293 | 0.6% |
| ) | 259 | 0.6% |
| 0 | 164 | 0.4% |
| 3 | 52 | 0.1% |
| Other values (8) | 63 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1067404 | |
| None | 19124 | 1.8% |
| Latin Ext Additional | 168 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 156174 | 14.6% |
| o | 74427 | 7.0% |
| n | 72702 | 6.8% |
| e | 72325 | 6.8% |
| i | 68255 | 6.4% |
| r | 58313 | 5.5% |
| t | 45881 | 4.3% |
| u | 41995 | 3.9% |
| 39271 | 3.7% | |
| l | 37304 | 3.5% |
| Other values (60) | 400757 |
None
| Value | Count | Frequency (%) |
| é | 5729 | |
| á | 3632 | |
| í | 3054 | |
| ú | 1751 | 9.2% |
| ó | 1611 | 8.4% |
| è | 1050 | 5.5% |
| ô | 559 | 2.9% |
| ñ | 346 | 1.8% |
| â | 323 | 1.7% |
| É | 204 | 1.1% |
| Other values (29) | 865 | 4.5% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ả | 49 | |
| ộ | 36 | |
| ạ | 35 | |
| ậ | 15 | 8.9% |
| ắ | 9 | 5.4% |
| ồ | 5 | 3.0% |
| ọ | 5 | 3.0% |
| ấ | 4 | 2.4% |
| ỏ | 4 | 2.4% |
| ị | 3 | 1.8% |
| Other values (2) | 3 | 1.8% |
level3Gid
Text
Missing 
| Distinct | 1589 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 539154 |
| Missing (%) | 89.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 11 |
| Mean length | 11.6395011 |
| Min length | 11 |
Unique
| Unique | 471 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Z01.14.3.1_1 |
|---|---|
| 2nd row | ZAF.8.5.3_1 |
| 3rd row | Z06.6.1.4_1 |
| 4th row | BFA.8.2.6_1 |
| 5th row | PHL.59.10.11_1 |
| Value | Count | Frequency (%) |
| sle.1.2.8_1 | 1037 | 1.7% |
| eth.8.8.11_1 | 988 | 1.6% |
| pan.11.1.1_1 | 727 | 1.2% |
| sle.2.1.13_1 | 717 | 1.2% |
| mar.6.2.2_1 | 637 | 1.0% |
| pan.2.10.3_1 | 610 | 1.0% |
| ssd.1.2.1_1 | 426 | 0.7% |
| zaf.8.5.3_1 | 419 | 0.7% |
| ben.2.5.2_1 | 418 | 0.7% |
| pan.4.2.6_1 | 413 | 0.7% |
| Other values (1579) | 55905 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 186891 | |
| 1 | 125870 | |
| _ | 62297 | 8.6% |
| 2 | 41681 | 5.7% |
| 3 | 25313 | 3.5% |
| A | 25141 | 3.5% |
| 4 | 22242 | 3.1% |
| 5 | 18614 | 2.6% |
| 6 | 15803 | 2.2% |
| Z | 15767 | 2.2% |
| Other values (24) | 185487 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 291915 | |
| Other Punctuation | 186891 | |
| Uppercase Letter | 184003 | |
| Connector Punctuation | 62297 | 8.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 25141 | |
| Z | 15767 | 8.6% |
| N | 15438 | 8.4% |
| F | 13775 | 7.5% |
| M | 13210 | 7.2% |
| E | 12819 | 7.0% |
| R | 10146 | 5.5% |
| I | 10093 | 5.5% |
| D | 8484 | 4.6% |
| B | 8385 | 4.6% |
| Other values (12) | 50745 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 125870 | |
| 2 | 41681 | 14.3% |
| 3 | 25313 | 8.7% |
| 4 | 22242 | 7.6% |
| 5 | 18614 | 6.4% |
| 6 | 15803 | 5.4% |
| 8 | 12708 | 4.4% |
| 0 | 11747 | 4.0% |
| 9 | 9752 | 3.3% |
| 7 | 8185 | 2.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 186891 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 62297 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 541103 | |
| Latin | 184003 | 25.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 25141 | |
| Z | 15767 | 8.6% |
| N | 15438 | 8.4% |
| F | 13775 | 7.5% |
| M | 13210 | 7.2% |
| E | 12819 | 7.0% |
| R | 10146 | 5.5% |
| I | 10093 | 5.5% |
| D | 8484 | 4.6% |
| B | 8385 | 4.6% |
| Other values (12) | 50745 |
Common
| Value | Count | Frequency (%) |
| . | 186891 | |
| 1 | 125870 | |
| _ | 62297 | 11.5% |
| 2 | 41681 | 7.7% |
| 3 | 25313 | 4.7% |
| 4 | 22242 | 4.1% |
| 5 | 18614 | 3.4% |
| 6 | 15803 | 2.9% |
| 8 | 12708 | 2.3% |
| 0 | 11747 | 2.2% |
| Other values (2) | 17937 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 725106 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 186891 | |
| 1 | 125870 | |
| _ | 62297 | 8.6% |
| 2 | 41681 | 5.7% |
| 3 | 25313 | 3.5% |
| A | 25141 | 3.5% |
| 4 | 22242 | 3.1% |
| 5 | 18614 | 2.6% |
| 6 | 15803 | 2.2% |
| Z | 15767 | 2.2% |
| Other values (24) | 185487 |
level3Name
Text
Missing 
| Distinct | 1550 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 539390 |
| Missing (%) | 89.7% |
| Memory size | 4.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 9.01318058 |
| Min length | 2 |
Unique
| Unique | 451 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | n.a. ( 4) |
|---|---|
| 2nd row | Kai !Garib |
| 3rd row | Kargil |
| 4th row | Yamba |
| 5th row | Malaking Patag |
| Value | Count | Frequency (%) |
| ward | 1627 | 1.9% |
| n.a | 1294 | 1.5% |
| 1255 | 1.5% | |
| lower | 1037 | 1.2% |
| bambara | 1037 | 1.2% |
| seka | 993 | 1.1% |
| chekorsa | 988 | 1.1% |
| na | 839 | 1.0% |
| arraiján | 727 | 0.8% |
| tambakha | 717 | 0.8% |
| Other values (1794) | 75841 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 81980 | 14.7% |
| i | 36730 | 6.6% |
| n | 36575 | 6.5% |
| o | 35678 | 6.4% |
| e | 34413 | 6.2% |
| r | 27496 | 4.9% |
| u | 26462 | 4.7% |
| 24294 | 4.3% | |
| g | 17364 | 3.1% |
| l | 17126 | 3.1% |
| Other values (97) | 221249 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 434638 | |
| Uppercase Letter | 83597 | 14.9% |
| Space Separator | 24294 | 4.3% |
| Other Punctuation | 5254 | 0.9% |
| Decimal Number | 4700 | 0.8% |
| Open Punctuation | 2638 | 0.5% |
| Close Punctuation | 2632 | 0.5% |
| Dash Punctuation | 1614 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 81980 | |
| i | 36730 | 8.5% |
| n | 36575 | 8.4% |
| o | 35678 | 8.2% |
| e | 34413 | 7.9% |
| r | 27496 | 6.3% |
| u | 26462 | 6.1% |
| g | 17364 | 4.0% |
| l | 17126 | 3.9% |
| m | 14811 | 3.4% |
| Other values (51) | 106003 |
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 7528 | 9.0% |
| S | 7526 | 9.0% |
| T | 6850 | 8.2% |
| B | 6823 | 8.2% |
| M | 6606 | 7.9% |
| A | 5229 | 6.3% |
| C | 5171 | 6.2% |
| G | 5052 | 6.0% |
| N | 4059 | 4.9% |
| L | 3952 | 4.7% |
| Other values (17) | 24801 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1266 | |
| 2 | 536 | |
| 7 | 498 | 10.6% |
| 9 | 455 | 9.7% |
| 6 | 384 | 8.2% |
| 3 | 380 | 8.1% |
| 0 | 346 | 7.4% |
| 4 | 333 | 7.1% |
| 8 | 312 | 6.6% |
| 5 | 190 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3068 | |
| / | 1029 | 19.6% |
| ! | 638 | 12.1% |
| ' | 317 | 6.0% |
| , | 202 | 3.8% |
Space Separator
| Value | Count | Frequency (%) |
| 24294 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2638 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2632 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1614 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 518235 | |
| Common | 41132 | 7.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 81980 | |
| i | 36730 | 7.1% |
| n | 36575 | 7.1% |
| o | 35678 | 6.9% |
| e | 34413 | 6.6% |
| r | 27496 | 5.3% |
| u | 26462 | 5.1% |
| g | 17364 | 3.4% |
| l | 17126 | 3.3% |
| m | 14811 | 2.9% |
| Other values (78) | 189600 |
Common
| Value | Count | Frequency (%) |
| 24294 | ||
| . | 3068 | 7.5% |
| ( | 2638 | 6.4% |
| ) | 2632 | 6.4% |
| - | 1614 | 3.9% |
| 1 | 1266 | 3.1% |
| / | 1029 | 2.5% |
| ! | 638 | 1.6% |
| 2 | 536 | 1.3% |
| 7 | 498 | 1.2% |
| Other values (9) | 2919 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 554144 | |
| None | 4929 | 0.9% |
| Latin Ext Additional | 294 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 81980 | 14.8% |
| i | 36730 | 6.6% |
| n | 36575 | 6.6% |
| o | 35678 | 6.4% |
| e | 34413 | 6.2% |
| r | 27496 | 5.0% |
| u | 26462 | 4.8% |
| 24294 | 4.4% | |
| g | 17364 | 3.1% |
| l | 17126 | 3.1% |
| Other values (61) | 216026 |
None
| Value | Count | Frequency (%) |
| é | 2169 | |
| á | 961 | |
| è | 846 | 17.2% |
| ó | 452 | 9.2% |
| ñ | 162 | 3.3% |
| â | 63 | 1.3% |
| ơ | 63 | 1.3% |
| ú | 48 | 1.0% |
| ư | 29 | 0.6% |
| Đ | 22 | 0.4% |
| Other values (13) | 114 | 2.3% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ọ | 78 | |
| ả | 46 | |
| ớ | 35 | |
| ạ | 33 | |
| ế | 21 | 7.1% |
| ố | 19 | 6.5% |
| ậ | 12 | 4.1% |
| ờ | 11 | 3.7% |
| ỹ | 10 | 3.4% |
| ắ | 9 | 3.1% |
| Other values (3) | 20 | 6.8% |
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 210302 |
| Missing (%) | 35.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | LC |
|---|---|
| 2nd row | LC |
| 3rd row | LC |
| 4th row | LC |
| 5th row | LC |
| Value | Count | Frequency (%) |
| lc | 316013 | |
| ne | 32299 | 8.3% |
| vu | 20062 | 5.1% |
| nt | 8397 | 2.1% |
| en | 8040 | 2.1% |
| dd | 3578 | 0.9% |
| cr | 2355 | 0.6% |
| ex | 375 | 0.1% |
| ew | 30 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 318368 | |
| L | 316013 | |
| N | 48736 | 6.2% |
| E | 40744 | 5.2% |
| V | 20062 | 2.6% |
| U | 20062 | 2.6% |
| T | 8397 | 1.1% |
| D | 7156 | 0.9% |
| R | 2355 | 0.3% |
| X | 375 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 782298 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 318368 | |
| L | 316013 | |
| N | 48736 | 6.2% |
| E | 40744 | 5.2% |
| V | 20062 | 2.6% |
| U | 20062 | 2.6% |
| T | 8397 | 1.1% |
| D | 7156 | 0.9% |
| R | 2355 | 0.3% |
| X | 375 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 782298 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 318368 | |
| L | 316013 | |
| N | 48736 | 6.2% |
| E | 40744 | 5.2% |
| V | 20062 | 2.6% |
| U | 20062 | 2.6% |
| T | 8397 | 1.1% |
| D | 7156 | 0.9% |
| R | 2355 | 0.3% |
| X | 375 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 782298 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 318368 | |
| L | 316013 | |
| N | 48736 | 6.2% |
| E | 40744 | 5.2% |
| V | 20062 | 2.6% |
| U | 20062 | 2.6% |
| T | 8397 | 1.1% |
| D | 7156 | 0.9% |
| R | 2355 | 0.3% |
| X | 375 | < 0.1% |